Qwen3 Fine-tuning: Medical R1 Style Chat
☆305May 31, 2025Updated 10 months ago
Alternatives and similar repositories for Qwen3-Medical-SFT
Users that are interested in Qwen3-Medical-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 大语言模型微调,Qwen2VL、Qwen2、GLM4指令微调☆627May 26, 2025Updated 10 months ago
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Updated this week
- ☆10Apr 30, 2025Updated 11 months ago
- 一个包含了多种主流大模型微调方案的实战代码库,基于Qwen3系列模型☆126Aug 10, 2025Updated 8 months ago
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 2024百度商业AI技术创新大赛赛道一:基于大模型的广告检索全国一等奖获奖方案☆17Feb 23, 2025Updated last year
- Tensorflow 1.x solution for chinese NER task, using ALBERT-LSTM-CRF model☆18Apr 19, 2020Updated 5 years ago
- Smart LLM/Agent Management in One Line of Code☆21Mar 22, 2026Updated 3 weeks ago
- 天池Better Synth多模态大模型数据合成挑战赛-打赢baseline就算成功方案☆28Oct 30, 2025Updated 5 months ago
- LibreOJ Problem download tool☆21Oct 6, 2024Updated last year
- A free program with a user-friendly interface that allows you to download Office 365, 2024, 2021, 2019, 2016 as well as Project and Visio☆30Sep 29, 2025Updated 6 months ago
- TensorFlow code and pre-trained models for BERT☆12Mar 19, 2019Updated 7 years ago
- SwanLab Official Documentation | SwanLab官方文档☆23Updated this week
- ☆11Oct 31, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- An pytorch implementation of MatchPyramid "Text Matching as Image Recognition"☆11Jul 25, 2024Updated last year
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 11 months ago
- GraphRAG 的中文优化版本☆23Dec 19, 2025Updated 3 months ago
- Psy-Insight: Mental Health Oriented Interpretable Multi-turn Bilingual Counseling Dataset for Large Language Model Finetuning☆22Jan 4, 2026Updated 3 months ago
- Finetuned Mistral that suggests Movies!☆11Jan 4, 2024Updated 2 years ago
- ☆14Apr 19, 2024Updated last year
- 基于pytorch + bert的多标签文本分类(multi label text classification)☆108Jul 18, 2023Updated 2 years ago
- 基于OpenCV的刷脸考勤&人脸校验&用户管理系统(源码&教程)☆11Nov 25, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆73Jan 4, 2026Updated 3 months ago
- 本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实现高效、准确且具有解释性…☆43Mar 10, 2025Updated last year
- Hybrid-Anchor Rotation Detector for Oriented Object Detection (ICCV'25-SEA)☆16Aug 11, 2025Updated 8 months ago
- ☆11May 9, 2023Updated 2 years ago
- ☆13Feb 3, 2022Updated 4 years ago
- This announcement is used in the ATMHUFK's video. The original is from the another up,Which is called 原无奇变in Chinese.You can use it to av…☆10Jan 26, 2025Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, …☆13,631Updated this week
- ☆23Oct 14, 2024Updated last year
- A comparison of deepseek grpo and qwen gspo on Qwen2.5-1.5B-Instruct fine tunning.☆163Mar 28, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆21Mar 6, 2026Updated last month
- ☆35Jul 27, 2025Updated 8 months ago
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆29,737Updated this week
- A segmentation project based on aniseg, trained on yolov8-seg☆13Jul 15, 2023Updated 2 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- ☆16Nov 11, 2025Updated 5 months ago
- ☆85Sep 25, 2025Updated 6 months ago