shareAGI / alignment-handbook-cn
中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.
☆11Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for alignment-handbook-cn
- pretrain a wiki llm using transformers☆10Updated 2 months ago
- Zero-human, cold-start construction of long-chain agents in professional domains☆11Updated last week
- LLM RAG 应用,支持 API 调用,语音交互。☆10Updated 4 months ago
- ☆13Updated 5 months ago
- ☆13Updated this week
- SUS-Chat: Instruction tuning done right☆47Updated 10 months ago
- 用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…☆9Updated 2 months ago
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆11Updated 9 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 7 months ago
- accelerate generating vector by using onnx model☆12Updated 9 months ago
- ✅4g GPU 可用 | 简易实现ChatGLM单机调用多个计算设备(GPU、CPU)进行推理☆34Updated last year
- 通义千问的DPO训练☆27Updated 2 months ago
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆25Updated 4 months ago
- Qwen-Efficient-Tuning☆42Updated last year
- 大模型检索增强生成技术最佳实践。☆46Updated 2 months ago
- GraphRAG在2024.11.5发布 0.4.0新版本,引入增量更新索引和DRIFT图推理搜索查询,本项目对新增的两个新功能进行全面测试,并提供了一种支持多类型大模型使用GraphRAG解决方案,不仅支持GPT大模型,还支持本地大模型(Ollama)、阿里云通义千问、百…☆13Updated 2 weeks ago
- 通用简单工具项目☆14Updated last month
- ☆11Updated 9 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆44Updated 6 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆55Updated 2 months ago
- rwkv finetuning☆36Updated 7 months ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated 6 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated 7 months ago
- vanna.ai demo☆17Updated 6 months ago
- Reinforcement Learning Toolkit for RWKV. Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning Let's boost the model's int…☆19Updated this week
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆42Updated this week
- ☆16Updated 11 months ago
- Music large model based on InternLM2-chat.☆21Updated 4 months ago
- ☆22Updated last month