owenliang / qwen-dpoView external linksLinks
通义千问的DPO训练
☆63Sep 21, 2024Updated last year
Alternatives and similar repositories for qwen-dpo
Users that are interested in qwen-dpo are comparing it to the libraries listed below
Sorting:
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 2 months ago
- LLM Tokenizer with BPE algorithm☆47May 7, 2024Updated last year
- ☆16Apr 1, 2025Updated 10 months ago
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- 基于Qwen2.5模型、使用DISC-Law-SFT-Pair数据集微调的法律大模型☆10Dec 29, 2024Updated last year
- 🚀 Fine-tune Large Language Models on AWS SageMaker using LLaMA Factory - End-to-end pipeline for distributed LLM training, evaluation & …☆18Dec 5, 2024Updated last year
- A simple deep learning framework inspired by Dezero and PyTorch☆31Jan 27, 2025Updated last year
- 基于Qwen-2.5-1.5B 进行DPO fine-tuning后,意外说真话的AI暴躁哥☆69Jan 18, 2025Updated last year
- 微调阿里开 源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆15Sep 6, 2024Updated last year
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- Using vanna framework and custom api. Vanna框架和自定义API的完整调用☆20Jul 17, 2024Updated last year
- ☆11Updated this week
- simple decoder-only GTP model in pytorch☆43May 19, 2024Updated last year
- ☆59Mar 8, 2025Updated 11 months ago
- RAG向量召回示例☆149Feb 14, 2024Updated 2 years ago
- Achieve your exclusive DeepResearch.☆24Apr 25, 2025Updated 9 months ago
- ModelScope+Transformers+SwanLab实现Qwen-1.5-7b的指令微调任务☆23Jun 9, 2024Updated last year
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆22Feb 8, 2025Updated last year
- qwen ai agent☆148Feb 21, 2024Updated last year
- Pytorch DDP Traning Demo☆30Oct 20, 2024Updated last year
- ☆129Aug 8, 2024Updated last year
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆24Jul 19, 2024Updated last year
- Code for the MTEB leaderboard☆30Feb 4, 2025Updated last year
- SoulStar 是一个心理咨询大模型,内核为温柔知心的大姐姐,能详细分析倾诉的问题,给出切实的建议和安慰,并有可爱表情和颜文字回复~~(*╹▽╹*)☆32Mar 3, 2024Updated last year
- ☆139Sep 29, 2024Updated last year
- 基于Llamaindex微调qwen2.5-7b☆35Dec 23, 2024Updated last year
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 7 months ago
- Fine-Tuning LLM and embedding models☆27Sep 12, 2023Updated 2 years ago
- 一个用于快速入门transformer的仓库,梳理相关nlp和vit模型结构、原理,训练的基本步骤及微调方法, 配套能快速学习的代码实战项目☆34Mar 25, 2025Updated 10 months ago
- ☆42Mar 6, 2025Updated 11 months ago
- A simple WeChat Official Account layout tool based on Dify☆16Jun 27, 2025Updated 7 months ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 8 months ago
- 大模型智能体Agent中文教程,博客代码仓库☆58Nov 5, 2025Updated 3 months ago
- ☆77Nov 13, 2023Updated 2 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- ☆28Dec 4, 2025Updated 2 months ago
- 100 Production-Ready Claude Code Skills - The most comprehensive collection of AI skills for sales, business automation, content creation…☆35Oct 22, 2025Updated 3 months ago
- Workflow automation, but you just describe what you want and it happens.☆26Nov 22, 2025Updated 2 months ago