owenliang / qwen-dpoLinks

通义千问的DPO训练

☆56

Alternatives and similar repositories for qwen-dpo

Users that are interested in qwen-dpo are comparing it to the libraries listed below

Sorting:

cwxndl / LLM
大语言模型应用：RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛
☆71Updated 8 months ago
yuanzhoulvpi2017 / SentenceEmbedding
☆119Updated last year
owenliang / bpe-tokenizer
LLM Tokenizer with BPE algorithm
☆43Updated last year
LDLINGLINGLING / adan_application
一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR
☆191Updated 2 months ago
liujunwen23 / MIRE
WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge
☆125Updated 11 months ago
heyblackC / BetterMixture-Top1-Solution
天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案
☆32Updated last year
akaihaoshuai / baby-llama2-chinese_cybertron
使用单个24G显卡，从0开始训练LLM
☆56Updated 3 months ago
percent4 / llm_math_solver
本项目用于大模型数学解题能力方面的数据集合成，模型训练及评测，相关文章记录。
☆95Updated last year
Sshuoshuo / easy-rag
快速入门RAG与私有化部署
☆209Updated last year
taishan1994 / Llama3.1-Finetuning
对llama3进行全参微调、lora微调以及qlora微调。
☆210Updated last year
KMnO4-zx / TinyRAG
TinyRAG
☆353Updated 4 months ago
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆68Updated last year
aJupyter / ThinkLLM
ThinkLLM：🚀 轻量、高效的大语言模型算法实现
☆105Updated 5 months ago
limafang / tiny-graphrag
☆43Updated 5 months ago
Mxoder / LLM-from-scratch
一些 LLM 方面的从零复现笔记
☆228Updated 6 months ago
AI-Study-Han / Zero-Qwen-VL
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
☆74Updated last year
muyaostudio / qwen2_seq_cls
使用 Qwen2ForSequenceClassification 简单实现文本分类任务。
☆83Updated last year
kanhaoning / RAG-Optimization-Practices
☆70Updated 3 months ago
limafang / agent-arxiv-daily
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文（已附带中文摘要翻译）
☆31Updated last week
percent4 / embedding_model_exp
本项目用于Embedding模型的相关实验，包括Embedding模型评估、Embedding模型微调、Embedding模型量化等。
☆64Updated last year
RethinkFun / LLM
☆110Updated last year
owenliang / agent
qwen ai agent
☆139Updated last year
dawoshi / Tianchi-LLM-QA
阿里天池: 2023全球智能汽车AI挑战赛——赛道一：AI大模型检索问答 baseline 80+
☆114Updated last year
Logistic98 / rag-omni
基于大语言模型的检索增强生成RAG示例
☆161Updated 5 months ago
yang19527 / AwesomeInterview
包含程序员面试大厂面试题和面试经验
☆192Updated 5 months ago
Glanvery / LLM-Travel
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆347Updated last year
waylandzhang / DeepSeek-RL-Qwen-0.5B-GRPO-gsm8k
☆84Updated 8 months ago
yongzhuo / Qwen-SFT
阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理
☆122Updated last year
AI-Study-Han / Zero-Chatgpt
从0开始，将chatgpt的技术路线跑一遍。
☆264Updated last year
chunhuizhang / llm_rl
llm & rl
☆236Updated last week