通义千问的DPO训练
☆65Sep 21, 2024Updated last year
Alternatives and similar repositories for qwen-dpo
Users that are interested in qwen-dpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated last year
- RAG向量召回示例☆152Feb 14, 2024Updated 2 years ago
- ☆66Aug 23, 2024Updated last year
- 基于Qwen-2.5-1.5B 进行DPO fine-tuning后,意外说真话的AI暴躁哥☆70Jan 18, 2025Updated last year
- qwen ai agent☆150Feb 21, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple deep learning framework inspired by Dezero and PyTorch☆31Jan 27, 2025Updated last year
- ☆135Aug 8, 2024Updated last year
- Achieve your exclusive DeepResearch.☆26Apr 25, 2025Updated 11 months ago
- simple decoder-only GTP model in pytorch☆44May 19, 2024Updated last year
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- ☆19Mar 6, 2024Updated 2 years ago
- [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning☆16May 27, 2025Updated 10 months ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆143Sep 29, 2024Updated last year
- ☆76Nov 13, 2023Updated 2 years ago
- Master UI for ROBOKOP☆15Mar 31, 2023Updated 3 years ago
- 收集整理大模型面试题☆12Aug 29, 2024Updated last year
- SoulStar 是一个心理咨询大模型,内核为温柔知心的大姐姐,能详细分析倾诉的问题,给出切实的建议和安慰,并有可爱表情和颜文字回复~~(*╹▽╹*)☆33Mar 3, 2024Updated 2 years ago
- We implement an efficient mechanism for compressing large networks by {\em tensorizing\/} network layers: i.e. mapping layers on to high-…☆11Jul 10, 2018Updated 7 years ago
- 一个用于快速入门transformer的仓库,梳理相关nlp和vit模型结构、原理,训练的基本步骤及微调方法, 配套能快速学习的代码实战项目☆35Mar 25, 2025Updated last year
- Native AI 是一个探索本地生活电商领域的多智能体系统,通过 AI 助手一站式解决用户吃喝玩乐住行等日常生活需求。系统基于大语言模型技术,主要为了探索Multi Agent的应用。☆12Apr 13, 2025Updated last year
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆14Sep 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple code for the tutorial on Polynomial Nets.☆13Jan 19, 2023Updated 3 years ago
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 9 months ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- 基于BERT+Biaffine结构的关系抽取模型☆12Feb 23, 2022Updated 4 years ago
- ☆15Nov 10, 2023Updated 2 years ago
- GraphRAG 中文文档。GraphRAG是一种结构化的、分层的检索增强生成(RAG)方法,而不是使用纯文本片段的语义搜索方法。GraphRAG 过程包括从原始文本中提取出知识图谱,构建社群层级(这种结构通常用来描述个体、群体及它们之间的关系,帮助理解信息如何在社群内部传…☆19Jul 12, 2024Updated last year
- Parallel_Computer_Architecture经典书籍☆17May 13, 2022Updated 3 years ago
- An AI project to provide `private` chat and RAG service. 一个提供私有化检索增强生成的AI项目☆11Jul 14, 2024Updated last year
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Team UWA's visualisation app developed as part of the ICDM 2019 Knowledge Graph Contest.☆13Dec 8, 2022Updated 3 years ago
- Neural Processing Letters: End-to-End Entity Detection with Proposer and Regressor☆12Jun 6, 2023Updated 2 years ago
- MNIST experiment from Tensorizing neural networks (Novikov et al. 2015)☆14Oct 22, 2019Updated 6 years ago
- pytorch复现transformer☆92Jan 18, 2024Updated 2 years ago
- Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取☆15Nov 16, 2019Updated 6 years ago
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- 基于retinaface人脸检测和facenet人脸识别的Flask服务☆14Mar 6, 2023Updated 3 years ago