linjh1118 / Llama3-Chinese-ORPO
基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3
☆17Updated 9 months ago
Alternatives and similar repositories for Llama3-Chinese-ORPO:
Users that are interested in Llama3-Chinese-ORPO are comparing it to the libraries listed below
- A simple way to synthesize LLM training data. (under construction⚠)☆16Updated last month
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 10 months ago
- 中文领域心理健康对话大模型simpsybot☆30Updated 2 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated 7 months ago
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"☆112Updated 2 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆53Updated 9 months ago
- pre-training llama3 using chinese☆13Updated 9 months ago
- 大语言模型训练和服务调研☆36Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆28Updated 8 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆59Updated 6 months ago
- ☆91Updated 2 months ago
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆26Updated 7 months ago
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆120Updated last month
- ☆22Updated 4 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆30Updated 7 months ago
- ☆16Updated 7 months ago
- 基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。☆25Updated last year
- 根据Qwen2(Qwen1.5)模型生成qwen2 MoE模型的工具☆13Updated 10 months ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆32Updated 6 months ago
- rwkv finetuning☆36Updated 10 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆45Updated 5 months ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆54Updated last year
- 多轮共情对话模型PICA☆89Updated last year
- LLM+RAG for QA☆21Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆64Updated last year
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆56Updated last week
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆49Updated last month
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 4 months ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆96Updated 9 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆11Updated 8 months ago