yongzhuo/Qwen-SFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yongzhuo/Qwen-SFT)

yongzhuo / Qwen-SFT

阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理

☆142

Alternatives and similar repositories for Qwen-SFT

Users that are interested in Qwen-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

owenliang / qwen-sft
View on GitHub
通义千问 SFT试验
☆83Jan 6, 2024Updated 2 years ago
minghaochen / chatglm3-base-tuning
View on GitHub
chatglm3base模型的有监督微调SFT
☆80Nov 5, 2023Updated 2 years ago
Mingrui-Li / Qwen-VL-Lora-Model
View on GitHub
可以成功Lora微调的Qwen-VL模型
☆16Oct 27, 2023Updated 2 years ago
circle-hit / SAPT
View on GitHub
Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …
☆40Jan 13, 2025Updated last year
Focusshang / Tutorial
View on GitHub
☆14Apr 19, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gbup-group / EAN-efficient-attention-network
View on GitHub
The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.
☆20Jun 16, 2023Updated 2 years ago
HLTCHKUST / MulQG
View on GitHub
Multi-hop Question Generation with Graph Convolutional Network
☆30Nov 2, 2022Updated 3 years ago
meowcao / InsuranceModel
View on GitHub
基于internlm-chat-7b的保险知识大模型微调
☆20Apr 26, 2024Updated 2 years ago
LutingWang / HEAD
View on GitHub
HEtero-Assists Distillation for Heterogeneous Object Detectors
☆10Jul 3, 2023Updated 2 years ago
yongzhuo / qwen2-sft
View on GitHub
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆73May 17, 2024Updated 2 years ago
XD3an / python-sequential-thinking-mcp
View on GitHub
A Python implementation of the Sequential Thinking MCP server using the official Model Context Protocol (MCP) Python SDK. This server fac…
☆24Jun 1, 2025Updated 11 months ago
dhg-wei / MCL
View on GitHub
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
☆28Sep 27, 2024Updated last year
yongzhuo / LLM-SFT
View on GitHub
中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…
☆215May 17, 2024Updated 2 years ago
OpenLMLab / ParallelTokenizer
View on GitHub
Use the tokenizer in parallel to achieve superior acceleration
☆20Mar 21, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ArtificialZeng / Qwen-Tuning
View on GitHub
Qwen-Efficient-Tuning
☆44Aug 16, 2023Updated 2 years ago
TGLTommy / LangChain_LLM_ChatBot
View on GitHub
基于LLM和LangChain实现基于本地文档的QA chatbot
☆35Aug 13, 2023Updated 2 years ago
CV-xueba / A05_rl
View on GitHub
本课程主要介绍强化学习的基础知识，其目标是帮助同学们快速、顺利地进入强化学习及其应用领域的研究工作。课程主要内容包含有限马尔可夫决策过程，动态规划，无模型预测与控制(SASA,Q-Learning)，价值函数逼近(DQN)，策略梯度方法(REINFORCE)，执行者/评论者…
☆17Oct 17, 2022Updated 3 years ago
Arvid-pku / ATOKE
View on GitHub
[AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model
☆14Dec 17, 2023Updated 2 years ago
iLearn-Lab / ICML24-RoboMP2
View on GitHub
[ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…
☆11Apr 4, 2026Updated last month
MDI-Benchmark / MDI-Benchmark
View on GitHub
☆14Dec 18, 2024Updated last year
myt517 / DKT
View on GitHub
Official implementation of "Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning", ACL2022 main con…
☆14Jul 23, 2022Updated 3 years ago
shadowkiller33 / Language_attack
View on GitHub
A repo for LLM jailbreak
☆14Sep 5, 2023Updated 2 years ago
squest / zenx-integrated-learning
View on GitHub
Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell
☆19Oct 16, 2015Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
vivian1993 / NCRL
View on GitHub
The official Pytorch implementation of the paper Neural Compositional Rule Learning for Knowledge Graph Reasoning
☆36Jul 7, 2023Updated 2 years ago
JiachengLi1995 / JointIE
View on GitHub
A span-based joint named entity recognition (NER) and relation extraction model.
☆11Aug 5, 2020Updated 5 years ago
owenliang / asyncio-threadpool-demo
View on GitHub
fastapi异步IO+threadpool线程池的工作原理
☆18Feb 12, 2024Updated 2 years ago
yuanyehome / PALT
View on GitHub
This is the source code of our paper PALT in EMNLP2022.
☆13Nov 19, 2022Updated 3 years ago
JITTorch / jtorch
View on GitHub
☆13Feb 1, 2024Updated 2 years ago
hanabi7 / point_cloud_smplify
View on GitHub
smplify code for point cloud based HMR
☆10Jan 11, 2022Updated 4 years ago
Face-Human-Bench / face-human-bench
View on GitHub
☆13Sep 26, 2025Updated 8 months ago
hanqi-qi / Mirror
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
zycheiheihei / Transferable-Visual-Prompting
View on GitHub
[CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…
☆45Dec 20, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
modelscope / ms-swift
View on GitHub
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…
☆14,218May 22, 2026Updated last week
MlWoo / sentence2pinyin
View on GitHub
tts fronted-end
☆11Dec 19, 2018Updated 7 years ago
Adlik / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆12Nov 14, 2025Updated 6 months ago
traveler-leon / smart-customer-service-system
View on GitHub
构建一个基于大模型的智能客服系统，可提供静态知识问答(静态数据)、动态知识问答（数据库），业务办理（api调用）等功能，同时系统具有自我学习能力。定期的反思可让系统变得更强大。
☆94Nov 5, 2025Updated 6 months ago
mynewstart / Tianchi-Multi-Task-Learning
View on GitHub
第一名克莱登大学二队方案分享
☆18Mar 5, 2021Updated 5 years ago
PPMESSAGE / mod_ppmessagespeechdetect
View on GitHub
Speech detect of freeSwitch. With standard ASR interface of freeSwitch and send voice data via ESL.
☆12Apr 8, 2018Updated 8 years ago
K024 / chatglm-q
View on GitHub
Another ChatGLM2 implementation for GPTQ quantization
☆55Oct 15, 2023Updated 2 years ago