shareAI-lab / alignment-handbook-cn
中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.
☆11Updated 7 months ago
Alternatives and similar repositories for alignment-handbook-cn:
Users that are interested in alignment-handbook-cn are comparing it to the libraries listed below
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 2 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated 9 months ago
- Music large model based on InternLM2-chat.☆22Updated 4 months ago
- 通义千问的DPO训练☆46Updated 7 months ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- SUS-Chat: Instruction tuning done right☆48Updated last year
- MLLM @ Game☆11Updated 3 weeks ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated 7 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆12Updated 6 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- LLM Tokenizer with BPE algorithm☆31Updated 11 months ago
- Deepseek-r1复现科普与资源汇总☆21Updated last month
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆52Updated 3 months ago
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆11Updated 5 months ago
- 通用简单工具项目☆16Updated 6 months ago
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Updated 6 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- ☆17Updated 10 months ago
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆29Updated 9 months ago
- ☆26Updated 6 months ago
- A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using …☆10Updated last week
- ☆23Updated 6 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆11Updated 10 months ago
- ☆16Updated 10 months ago
- ✅4g GPU可用 | 简易实现ChatGLM单机调用多个计算设备(GPU、CPU)进行推理☆34Updated 2 years ago
- ThinkLLM:🚀 轻量、高效的大语言模型算法实现☆37Updated last week
- ☆94Updated 4 months ago
- 中文领域心理健康对话大模型simpsybot☆41Updated 4 months ago
- ☆19Updated 9 months ago