通义千问的DPO训练
☆65Sep 21, 2024Updated last year
Alternatives and similar repositories for qwen-dpo
Users that are interested in qwen-dpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated 2 years ago
- RAG向量召回示例☆154Feb 14, 2024Updated 2 years ago
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- ☆68Aug 23, 2024Updated last year
- 基于Qwen-2.5-1.5B 进行DPO fine-tuning后,意外说真话的AI暴躁哥☆72Jan 18, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- qwen ai agent☆151Feb 21, 2024Updated 2 years ago
- ☆140Aug 8, 2024Updated last year
- simple decoder-only GTP model in pytorch☆44May 19, 2024Updated 2 years ago
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Apr 13, 2026Updated 2 months ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- 通义千问 SFT试验☆83Jan 6, 2024Updated 2 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 3 years ago
- This software uses a config file (config.py), which is a settings file, to build and run SWAT+ models. Users can share the config along w…☆16Mar 9, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆145Sep 29, 2024Updated last year
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago
- ☆76Nov 13, 2023Updated 2 years ago
- 收集整理大模型面试题☆12Aug 29, 2024Updated last year
- vision transformer on mnist dataset☆49Mar 24, 2024Updated 2 years ago
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- DRL for WebRTC Control☆12Feb 3, 2024Updated 2 years ago
- We implement an efficient mechanism for compressing large networks by {\em tensorizing\/} network layers: i.e. mapping layers on to high-…☆11Jul 10, 2018Updated 7 years ago
- A python tool that generate latex(e.g. Table, matrix) code.☆10Jun 22, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Native AI 是一个探索本地生活电商领域的多智能体系统,通过 AI 助手一站式解决用户吃喝玩乐住行等日常生活需求。系统基于大语言模型技术,主要为了探索Multi Agent的应用。☆11Apr 13, 2025Updated last year
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆14Sep 6, 2024Updated last year
- Simple code for the tutorial on Polynomial Nets.☆13Jan 19, 2023Updated 3 years ago
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 11 months ago
- 中文纠错-使用拼音树及编辑距离☆13Jul 19, 2019Updated 6 years ago
- Pytorch DDP Traning Demo☆31Oct 20, 2024Updated last year
- ☆16Nov 10, 2023Updated 2 years ago
- Official repository for AAAI'23 paper: Let Graph be the Go Board: Gradient-free Node Injection Attack for Graph Neural Networks via Reinf…☆31Nov 26, 2022Updated 3 years ago
- GraphRAG 中文文档。GraphRAG是一种结构化的、分层的检索增强生成(RAG)方法,而不是使用纯文本片段的语义搜索方法。GraphRAG 过程包括从原始文本中提取出知识图谱,构建社群层级(这种结构通常用来描述个体、群体及它们之间的关系,帮助理解信息如何在社群内部传…☆19Jul 12, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 在本项目中模拟健康档案私有知识库构建和检索全流程,通过一份代码实现了同时支持多种大模型(如OpenAI、阿里通义千问等)的RAG(检索增强生成)功能:(1)离线步骤:文档加载->文档切分->向量化->灌入向量数据库;在线步骤:获取用户问题->用户问题向量化->检索向量数据库…☆293Sep 6, 2024Updated last year
- LidarPointCloudReconstruction☆13May 26, 2024Updated 2 years ago
- The code of SKS☆15Mar 22, 2022Updated 4 years ago
- Team UWA's visualisation app developed as part of the ICDM 2019 Knowledge Graph Contest.☆12Dec 8, 2022Updated 3 years ago
- ☆17Jul 6, 2023Updated 2 years ago
- pytorch复现transformer☆93Jan 18, 2024Updated 2 years ago
- ☆13Mar 18, 2026Updated 3 months ago