shareAGI / alignment-handbook-cn

中文版hf-alignment-handbook，大模型全套sft、dpo、orpo、cpt训练教程.

☆11

Alternatives and similar repositories for alignment-handbook-cn:

Users that are interested in alignment-handbook-cn are comparing it to the libraries listed below

1100111GTH / XG-RAG
LLM RAG 应用，支持 API 调用，语音交互。
☆11Updated 8 months ago
limafang / Xtuner-GUI
Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…
☆13Updated last year
linjh1118 / Llama3-Chinese-ORPO
基于Llama3，通过进一步CPT，SFT，ORPO得到的中文版Llama3
☆17Updated 10 months ago
GuoYiFantastic / IMelodist
Music large model based on InternLM2-chat.
☆22Updated 3 months ago
wux-labs / OpenXLab-IntelligentSalesAssistant
☆17Updated 9 months ago
limafang / tiny-graphrag
☆36Updated 3 months ago
360AILABNLP / 360LayoutAnalysis
☆26Updated 5 months ago
seanzhang-zhichen / Qwen-WisdomVast
Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …
☆18Updated 11 months ago
ZBayes / poc_project
通用简单工具项目
☆17Updated 5 months ago
t6am3 / law_glm_baseline
☆15Updated 9 months ago
owenliang / qwen-dpo
通义千问的DPO训练
☆40Updated 6 months ago
jackfsuia / LLM-Data-Cleaner
用大模型批量处理数据，现支持各种大模型做OCR，支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…
☆13Updated 6 months ago
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆54Updated 10 months ago
scchy / XtunerGUI
Xtuner Factory
☆33Updated last year
ai-in-pm / rStar-Math
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
☆38Updated 2 months ago
Lightblues / AgentRE
Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".
☆62Updated 7 months ago
yanqiangmiffy / tree2retriever
Recursive Abstractive Processing for Tree-Organized Retrieval
☆11Updated 9 months ago
yongzhuo / gemma-sft
Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)
☆29Updated 10 months ago
yaosenJ / CoalQA
使用煤矿历史事故案例，事故处理报告、安全规程规章制度、技术文档、煤矿从业人员入职考试题库等数据，微调internlm2模型实现针对煤矿事故和煤矿安全知识的智能问答。
☆47Updated 2 months ago
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated 11 months ago
heyblackC / BetterMixture-Top1-Solution
天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案
☆27Updated 8 months ago
360CVGroup / 360VL
Our 2nd-gen LMM
☆33Updated 10 months ago
ChaimEvans / ChatGLM_MultiGPUCPU_eval
✅4g GPU可用 | 简易实现ChatGLM单机调用多个计算设备（GPU、CPU）进行推理
☆34Updated last year
amulil / vector_by_onnxmodel
accelerate generating vector by using onnx model
☆15Updated last year
WalkerMitty / PDFparser
Here is a demo for PDF parser (Including OCR, object detection tools)
☆34Updated 5 months ago