linjh1118 / Llama3-Chinese-ORPO
基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3
☆17Updated last year
Alternatives and similar repositories for Llama3-Chinese-ORPO:
Users that are interested in Llama3-Chinese-ORPO are comparing it to the libraries listed below
- pre-training llama3 using chinese☆14Updated 11 months ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆33Updated 8 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 10 months ago
- 通义千问的DPO训练☆46Updated 7 months ago
- A fluent, scalable, and easy-to-use LLM data processing framework.☆17Updated last week
- ☆94Updated 4 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 根据Qwen2(Qwen1.5)模型生成qwen2 MoE模型的工具☆16Updated last year
- ☆46Updated 10 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆72Updated last month
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆174Updated last week
- SUS-Chat: Instruction tuning done right☆48Updated last year
- ☆23Updated 6 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆83Updated 7 months ago
- ☆32Updated last week
- Agentic RAG R1 Framework via Reinforcement Learning☆26Updated this week
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆26Updated 9 months ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆97Updated last year
- 大语言模型训练和服务调研☆37Updated last year
- ☆26Updated 5 months ago
- PGRAG☆48Updated 9 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 2 months ago
- [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning☆22Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆64Updated 9 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆47Updated 2 months ago
- From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation☆88Updated last month
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆66Updated 2 months ago
- ☆38Updated 5 months ago