通义千问的DPO训练
☆65Sep 21, 2024Updated last year
Alternatives and similar repositories for qwen-dpo
Users that are interested in qwen-dpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated 2 years ago
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- 基于Qwen-2.5-1.5B 进行DPO fine-tuning后,意外说真话的AI暴躁哥☆72Jan 18, 2025Updated last year
- qwen ai agent☆151Feb 21, 2024Updated 2 years ago
- ☆102Nov 5, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆138Aug 8, 2024Updated last year
- Achieve your exclusive DeepResearch.☆26Apr 25, 2025Updated last year
- simple decoder-only GTP model in pytorch☆44May 19, 2024Updated last year
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- ☆60Mar 8, 2025Updated last year
- [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning☆15May 27, 2025Updated 11 months ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- 通义千问 SFT试验☆83Jan 6, 2024Updated 2 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 零实现 AlphaGo Zero☆17Nov 10, 2024Updated last year
- ☆145Sep 29, 2024Updated last year
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago
- ☆76Nov 13, 2023Updated 2 years ago
- SoulStar 是一个心理咨询大模型,内核为温柔知心的大姐姐,能详细分析倾诉的问题,给出切实的建议和安慰,并有可爱表情和颜文字回复~~(*╹▽╹*)☆33Mar 3, 2024Updated 2 years ago
- ☆12Mar 6, 2023Updated 3 years ago
- ☆163Jun 25, 2025Updated 10 months ago
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆14Sep 6, 2024Updated last year
- Native AI 是一个探索本地生活电商领域的多智能体系统,通过 AI 助手一站式解决用户吃喝玩乐住行等日常 生活需求。系统基于大语言模型技术,主要为了探索Multi Agent的应用。☆12Apr 13, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 9 months ago
- 基于BERT+Biaffine结构的关系抽取模型☆12Feb 23, 2022Updated 4 years ago
- 中文纠错-使用拼音树及编辑距离☆13Jul 19, 2019Updated 6 years ago
- 《自然语言处理:大模型理论与实践》配套数据和代码☆77Dec 24, 2025Updated 4 months ago
- ☆15Nov 10, 2023Updated 2 years ago
- Official repository for AAAI'23 paper: Let Graph be the Go Board: Gradient-free Node Injection Attack for Graph Neural Networks via Reinf…☆30Nov 26, 2022Updated 3 years ago
- 中华药典RAG项目☆10Oct 26, 2024Updated last year
- GraphRAG 中文文档。GraphRAG是一种结构化的、分层的检索增强生成(RAG)方法,而不是使用纯文本片段的语义搜索方法。GraphRAG 过程包括从原始文本中提取出知识图谱,构建社群层级(这种结构通常用来描述个体、群体及它们之间的关系,帮助理解信息如何在社群内部传…☆19Jul 12, 2024Updated last year
- 使用numpy从零开始实现llama3的推理流程,并对其进行封装,对比GPU,CPU上的表现以及Lora微调。llama3 implemented from scratch using numpy and lora fine-tune.。☆12Jul 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 在本项目中模拟健康档案私有知识库构建和检索全流程,通过一份代码实现了同时支持多种大模型(如OpenAI、阿里通义千问等)的RAG(检索增强生成)功能:(1)离线步骤:文档加载->文档切分->向量化->灌入向量数据库;在线步骤:获取用户问题->用户问题向量化->检索向量数据库…☆266Sep 6, 2024Updated last year
- An AI project to provide `private` chat and RAG service. 一个提供私有化检索增强生成的AI项目☆11Jul 14, 2024Updated last year
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 5 months ago
- chinese few-shot ner☆16Aug 28, 2022Updated 3 years ago
- Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取☆15Nov 16, 2019Updated 6 years ago
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- Repository for Interoperability of FATE☆12Dec 31, 2025Updated 4 months ago