大语言模型微调,Qwen2VL、Qwen2、GLM4指令微调
☆632May 26, 2025Updated 11 months ago
Alternatives and similar repositories for LLM-Finetune
Users that are interested in LLM-Finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 一个用于预防经济诈骗的文本分类检测微调项目。☆88Feb 4, 2025Updated last year
- Qwen3 Fine-tuning: Medical R1 Style Chat☆311May 31, 2025Updated 11 months ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆219Oct 4, 2024Updated last year
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆30,252Apr 24, 2026Updated last week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆13,977Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ModelScope+Transformers+SwanLab实现Qwen-1.5-7b的指令微调任务☆23Jun 9, 2024Updated last year
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- LLM for NER☆82Jul 29, 2024Updated last year
- 轻量级 LLM Post-training 框架,支持 SFT、RLVR、On-Policy KD、Guide KD 及混合训练;实现单轮/多轮 Guide 蒸馏、多教师蒸馏、Reward 混合训练与自动化数据分流👩🎓👨🎓☆932Mar 8, 2026Updated last month
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆70,969Updated this week
- [COLING 2025] Official Repo for Paper "Beyond Boundaries: Learning Universal Entity Taxonomy across Datasets and Languages for Open Named…☆28Feb 5, 2026Updated 3 months ago
- SwanLab Official Documentation | SwanLab官方文档☆24Apr 28, 2026Updated last week
- ⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …☆3,896Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 一些 LLM 方面的从零复现笔记☆250Apr 29, 2025Updated last year
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆24,178Mar 12, 2026Updated last month
- 本项目旨在提供一个微调酒店推荐垂直领域大模型并应用的完整闭环案例作为大家的参考案例。本项目使用的基础大模型为Qwen2.5-7B-Instruct。项目特色:完整的垂直应用案例闭环、项目源码剖析开源共享、详实的图文指导手册、手把手全流程实操演示视频☆90Apr 23, 2025Updated last year
- ☆13Feb 17, 2025Updated last year
- 一个包含了多种主流大模型微调方案的实战代码库,基于Qwen3 系列模型☆129Aug 10, 2025Updated 8 months ago
- 阿里云天池 - GLM 法律行业大模型挑战赛 - 我们小组实现基于大模型的对话机器人源码☆17Oct 23, 2024Updated last year
- Smart LLM/Agent Management in One Line of Code☆21Mar 22, 2026Updated last month
- Train a 1B LLM with 1T tokens from scratch by personal☆800Apr 27, 2025Updated last year
- 主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题☆14,079Apr 30, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 大模型学习--从模型部署到模型微调,此项目是经过训练营学习后,结合训练营项目,自我理解消化总结,以及创新型应用。可star/fork☆21Mar 26, 2024Updated 2 years ago
- 整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。☆22,563Apr 23, 2026Updated last week
- 使用BERT-BILSTM-CRF进行中文命名实体识别。☆491Jan 9, 2025Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,645Oct 24, 2024Updated last year
- ☆11Nov 9, 2022Updated 3 years ago
- GraphRAG 中文文档。GraphRAG是一种结构化的、分层的检索增强生成(RAG)方法,而不是使用纯文本片段的语义搜索方法。GraphRAG 过程包括从原始文本中提取出知识图谱,构建社群层级(这种结构 通常用来描述个体、群体及它们之间的关系,帮助理解信息如何在社群内部传…☆19Jul 12, 2024Updated last year
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆7,078Jul 4, 2025Updated 10 months ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆33Feb 10, 2026Updated 2 months ago
- ☆30Feb 27, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama☆29Jun 26, 2023Updated 2 years ago
- ☆13Jun 5, 2024Updated last year
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,779Dec 12, 2023Updated 2 years ago
- 基于大语言模型的RAG项目,分别实现了基于文本和知识图谱的RAG☆29Dec 11, 2025Updated 4 months ago
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆4,799Feb 12, 2026Updated 2 months ago
- 爬取同花顺的股票(A股)信息☆10Nov 5, 2021Updated 4 years ago
- 复现大模型相关算法及一些学习记录☆3,324Mar 21, 2026Updated last month