Zeyi-Lin/Qwen3-Medical-SFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Zeyi-Lin/Qwen3-Medical-SFT)

Zeyi-Lin / Qwen3-Medical-SFT

Qwen3 Fine-tuning: Medical R1 Style Chat

☆332

Alternatives and similar repositories for Qwen3-Medical-SFT

Users that are interested in Qwen3-Medical-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Zeyi-Lin / LLM-Finetune
View on GitHub
大语言模型微调，Qwen2VL、Qwen2、GLM4指令微调
☆653May 26, 2025Updated last year
lijiayi-ai / Qwen3-FineTuning-Playground
View on GitHub
一个包含了多种主流大模型微调方案的实战代码库，基于Qwen3系列模型
☆134Aug 10, 2025Updated 11 months ago
Guldfisk5682 / TinyLLaVA-Qwen3
View on GitHub
一个低成本、易于上手的多模态大模型学习项目。基于Qwen3-0.6B和CLIP构建，使用LLaVA架构和LoRA微调，在消费级16G显卡上数小时即可完成训练
☆51Sep 15, 2025Updated 10 months ago
SwanHubX / glm4-finetune
View on GitHub
ChatGLM4微调简介
☆27Apr 8, 2025Updated last year
owenliang / qwen2.5-0.5b-grpo
View on GitHub
Qwen2.5 0.5B GRPO
☆86Feb 16, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TeenLucifer / grpo_reproduce
View on GitHub
A comparison of deepseek grpo and qwen gspo on Qwen2.5-1.5B-Instruct fine tunning.
☆171Mar 28, 2026Updated 4 months ago
qiufengqijun / mini_qwen
View on GitHub
这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。
☆862Feb 18, 2025Updated last year
wyf3 / llm_related
View on GitHub
复现大模型相关算法及一些学习记录
☆3,471Jul 2, 2026Updated 3 weeks ago
taishan1994 / pytorch_unbalanced_text_classification
View on GitHub
基于pytorch的不平衡数据的文本分类
☆12Dec 26, 2021Updated 4 years ago
Dylan9897 / LLM-TextClassification
View on GitHub
集成Qwen与DeepSeek等先进大语言模型，支持纯LLM+分类层模式及LLM+LoRA+分类层模式，使用transformers模块化设计和训练便于根据需要调整或替换组件。
☆21Sep 1, 2025Updated 10 months ago
FedDG23 / FedDG-main
View on GitHub
☆10Nov 21, 2023Updated 2 years ago
shibing624 / MedicalGPT
View on GitHub
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
☆5,670Jun 3, 2026Updated last month
datawhalechina / self-llm
View on GitHub
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
☆31,479Jul 15, 2026Updated 2 weeks ago
yy444 / GflowOpt
View on GitHub
☆13Aug 19, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zenghy96 / Reliable-Source-Approximation
View on GitHub
Reliable Source Approximation: Source-Free Domain Adaptation for Vestibular Schwannoma MRI Segmentation
☆11Dec 28, 2024Updated last year
KMnO4-zx / blog
View on GitHub
项目的issue会存放我的所有blog
☆21Sep 12, 2025Updated 10 months ago
DJofOUC / tensorflow_serving_docker_deploy
View on GitHub
deploy machine learning model in tensorflow sering and docker
☆10Dec 5, 2018Updated 7 years ago
NJUxlj / Travel-Agent-based-on-Qwen2-RLHF
View on GitHub
A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using …
☆81Jul 6, 2026Updated 3 weeks ago
modelscope / ms-swift
View on GitHub
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…
☆14,995Updated this week
WLS04 / EOPD
View on GitHub
☆20May 17, 2026Updated 2 months ago
hiyouga / LlamaFactory
View on GitHub
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
☆73,606Updated this week
tensorinfinitysip / a-PyTorch-Project-to-Image-Caption
View on GitHub
Image Caption with Attention | a PyTorch Project to Image Caption
☆17Jul 14, 2019Updated 7 years ago
jingyaogong / minimind-v
View on GitHub
👀「大模型」2小时从0训练65M参数的视觉多模态VLM！Train a 65M-parameter VLM from scratch in just 2h!
☆8,396Jun 28, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
xxm1668 / ChatGLM-Efficient-LORA
View on GitHub
基于大模型ChatGLM，微调方式为LORA，集SFT、RM、PPO算法为一体项目
☆15Jun 20, 2023Updated 3 years ago
winter1203 / vllm_GOT2_OCR
View on GitHub
Accelerating GOT-OCRv2 with VLLM
☆10Nov 15, 2024Updated last year
828Tina / deepseek-llm-7B-chat-lora-ft
View on GitHub
使用多轮对话数据集对deepseek进行lora微调教程
☆61Dec 26, 2024Updated last year
datawhalechina / tiny-universe
View on GitHub
《大模型白盒子构建指南》：一个全手搓的Tiny-Universe
☆4,987Feb 12, 2026Updated 5 months ago
liguodongiot / llm-action
View on GitHub
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
☆24,819Jul 19, 2026Updated last week
lemon-little / BetterSynth
View on GitHub
天池Better Synth多模态大模型数据合成挑战赛-打赢baseline就算成功方案
☆30Oct 30, 2025Updated 9 months ago
SwanHubX / SwanLab
View on GitHub
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …
☆4,098Updated this week
seohyunwoo-0407 / GAR
View on GitHub
FinanceRAG project by KAIST students. Advanced Retrieval-Augmented Generation (RAG) system designed for the financial domain.
☆17Feb 11, 2025Updated last year
Trae1ounG / 2024-BaiduAI-LLM-DSI
View on GitHub
2024百度商业AI技术创新大赛赛道一：基于大模型的广告检索全国一等奖获奖方案
☆19Feb 23, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yongzhuo / qwen2-sft
View on GitHub
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆73May 17, 2024Updated 2 years ago
albert-jeffery / DeepSpeed-Finetuning
View on GitHub
基于DeepSpeed的大模型微调教程，详细介绍如何使用DeepSpeed进行微调和分布式训练文本总结大模型
☆17May 6, 2026Updated 2 months ago
KMnO4-zx / tiny-llm
View on GitHub
☆34Jul 8, 2025Updated last year
Purdue-M2 / MedChat
View on GitHub
☆18Jun 8, 2025Updated last year
JZPeterPan / MedVLM-R1
View on GitHub
☆32Sep 17, 2025Updated 10 months ago
NJUxlj / Chinese-MedQA-Qwen2
View on GitHub
基于Qwen2+SFT+DPO的医疗问答系统，项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练，其次，项目还调用各种知识库工具（neo4j, milvus, LDA, 等）进行自动化训练数据生成。另外，使用 vllm 用于推理…
☆89Apr 29, 2026Updated 3 months ago
LuisEstebanAcevedoBringas / FCOS_torch
View on GitHub
☆10Jul 18, 2022Updated 4 years ago