通义千问的DPO训练
☆64Sep 21, 2024Updated last year
Alternatives and similar repositories for qwen-dpo
Users that are interested in qwen-dpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Tokenizer with BPE algorithm☆48May 7, 2024Updated last year
- RAG向量召回示例☆151Feb 14, 2024Updated 2 years ago
- ☆66Aug 23, 2024Updated last year
- 基于Qwen-2.5-1.5B 进行DPO fine-tuning后,意外说真话的AI暴躁哥☆70Jan 18, 2025Updated last year
- A simple deep learning framework inspired by Dezero and PyTorch☆31Jan 27, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆131Aug 8, 2024Updated last year
- Codebase used to generate the results for NeurIPS23 "Adversarial Training for Graph Neural Networks: Pitfalls, Solutions, and New Directi…☆13Dec 8, 2023Updated 2 years ago
- simple decoder-only GTP model in pytorch☆43May 19, 2024Updated last year
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- ☆59Mar 8, 2025Updated last year
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 2 years ago
- ☆143Sep 29, 2024Updated last year
- ☆76Nov 13, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- 收集整理大模型面试题☆12Aug 29, 2024Updated last year
- Empowering everyone to create reliable and safety AI coding agent.☆12Sep 2, 2024Updated last year
- MSTI☆16Mar 6, 2024Updated 2 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- SoulStar 是一个心理咨询大模型,内核为温柔知心的大姐姐,能详细分析倾诉的问题,给出切实的建议和安慰,并有可爱表情和颜文字回复~~(*╹▽╹*)☆32Mar 3, 2024Updated 2 years ago
- DRL for WebRTC Control☆12Feb 3, 2024Updated 2 years ago
- ☆12Mar 6, 2023Updated 3 years ago
- We implement an efficient mechanism for compressing large networks by {\em tensorizing\/} network layers: i.e. mapping layers on to high-…☆11Jul 10, 2018Updated 7 years ago
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆15Sep 6, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Native AI 是一个探索本地生活电商领域的多智能体系统,通过 AI 助手一站式解决用户吃喝玩乐住行等日常生活需求。系统基于大语言模型技术,主要为了探索Multi Agent的应用。☆12Apr 13, 2025Updated 11 months ago
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 8 months ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- 中文纠错-使用拼音树及编辑距离☆13Jul 19, 2019Updated 6 years ago
- Pytorch DDP Traning Demo☆30Oct 20, 2024Updated last year
- ☆15Nov 10, 2023Updated 2 years ago
- 在本项目中模拟健康档案私有知识库构建和检索全流程,通过一份代码实现了同时支持多种大模型(如OpenAI、阿里通义千问等)的RAG(检索增强生成)功能:(1)离线步骤:文档加载->文档切分->向量化->灌入向量数据库;在线步骤:获取用户问题->用户问题向量化->检索向量数据库…☆254Sep 6, 2024Updated last year
- Official repository for AAAI'23 paper: Let Graph be the Go Board: Gradient-free Node Injection Attack for Graph Neural Networks via Reinf…☆30Nov 26, 2022Updated 3 years ago
- GraphRAG 中文文档。GraphRAG是一种结构化的、分层的检索增强生成(RAG)方法,而不是使用纯文本片段的语义搜索方法。GraphRAG 过程包括从原始文本中提取出知识图谱,构建社群层级(这种结构通常用来描述个体、群体及它们之间的关系,帮助理解信息如何在社群内部传…☆19Jul 12, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 4 months ago
- An AI project to provide `private` chat and RAG service. 一个提供私有化检索增强生成的AI项目☆11Jul 14, 2024Updated last year
- The code of SKS☆15Mar 22, 2022Updated 4 years ago
- 一个教你如何Review的学习平台☆17Oct 20, 2022Updated 3 years ago
- Repository for Interoperability of FATE☆12Dec 31, 2025Updated 2 months ago
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆11Aug 20, 2024Updated last year