阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理
☆139May 17, 2024Updated last year
Alternatives and similar repositories for Qwen-SFT
Users that are interested in Qwen-SFT are comparing it to the libraries listed below
Sorting:
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆35Jan 6, 2026Updated 2 months ago
- chatglm3base模型的有监督微调SFT☆79Nov 5, 2023Updated 2 years ago
- ☆20Dec 27, 2025Updated 2 months ago
- ☆11Feb 3, 2025Updated last year
- 纯c++的全平台llm加 速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated last month
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆74May 17, 2024Updated last year
- HMM\CRF\BERT-CRF\BILSTM-CRF\BERTBILSTMCRF\XLNETBILSTMCRF☆33Jul 30, 2022Updated 3 years ago
- Turn Dify API into OpenAI API schema☆17Aug 16, 2024Updated last year
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- A script for merging a LLM model and a LoRA☆13Jun 22, 2023Updated 2 years ago
- FastMCP for Google's langextract library☆28Aug 6, 2025Updated 7 months ago
- 针对qwen微调模型进行数据预处理☆13Jan 8, 2024Updated 2 years ago
- 大型中文道德句数据集CMOS☆10Apr 11, 2022Updated 3 years ago
- simple decoder-only GTP model in pytorch☆43May 19, 2024Updated last year
- 政策新闻领域 实体识别+关系抽取 基于4000句txt微调得到☆11Apr 9, 2024Updated last year
- ☆14Apr 19, 2024Updated last year
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Dec 2, 2025Updated 3 months ago
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated last year
- LibreOJ Problem download tool☆21Oct 6, 2024Updated last year
- A free program with a user-friendly interface that allows you to download Office 365, 2024, 2021, 2019, 2016 as well as Project and Visio☆27Sep 29, 2025Updated 5 months ago
- Synthetic data generation for TODs☆23Jul 17, 2024Updated last year
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆41Sep 23, 2024Updated last year
- ☆19Oct 9, 2024Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆216May 17, 2024Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated last year
- 基于LLM和LangChain实现基于本地文档的QA chatbot☆35Aug 13, 2023Updated 2 years ago
- 本课程主要介绍强化学习的基础知识,其目标是帮助同学们快速、顺利地进入强化学习及其应用领域的研究工作。课程主要内容包含有限马尔可夫决策过程,动态规划,无模型预测与控制(SASA,Q-Learning),价值函数逼近(DQN),策略梯度方法(REINFORCE),执行者/评论者…☆18Oct 17, 2022Updated 3 years ago
- 基于电商数据微调的Qwen2.5系列的电商大模型,电商数据sft后电商大模型。是https://github.com/leeguandong/EcommerceLLM的升级版本。qwen2.5的效果很好。☆13Oct 4, 2024Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- Official implementation of "Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning", ACL2022 main con…☆14Jul 23, 2022Updated 3 years ago
- ☆58Updated this week
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- 利用简单的代码完成deepseek基于medical-o1-sft数据集的lora微调☆15Feb 25, 2025Updated last year
- The official Pytorch implementation of the paper Neural Compositional Rule Learning for Knowledge Graph Reasoning☆36Jul 7, 2023Updated 2 years ago
- Graph QABot Demo| 图谱问答案例☆15Apr 11, 2023Updated 2 years ago
- 基于opentype.js的手写字生成程序☆13Jan 29, 2023Updated 3 years ago