大语言模型微调,Qwen2VL、Qwen2、GLM4指令微调
☆617May 26, 2025Updated 9 months ago
Alternatives and similar repositories for LLM-Finetune
Users that are interested in LLM-Finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 天池Better Synth多模态大模型数据合成挑战赛-打赢baseline就算成功方案☆27Oct 30, 2025Updated 4 months ago
- Creating Your Divine Agent 😇☆10Jan 26, 2026Updated 2 months ago
- Qwen3 Fine-tuning: Medical R1 Style Chat☆297May 31, 2025Updated 9 months ago
- 基于LLM的命名实体识别和实体关系抽取☆17Jan 4, 2024Updated 2 years ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆217Oct 4, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆29,261Updated this week
- ModelScope+Transformers+SwanLab实现Qwen-1.5-7b的指令微调任务☆23Jun 9, 2024Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, …☆13,263Updated this week
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- ChatGLM4微调简介☆22Apr 8, 2025Updated 11 months ago
- LLM for NER☆81Jul 29, 2024Updated last year
- 轻量级 LLM Post-training 框架,支持 SFT、RLVR、On-Policy KD、Guide KD 及混合训练;实现单轮/多轮 Guide 蒸馏、多教师蒸馏、Reward 混合训练与自动化数 据分流👩🎓👨🎓☆931Mar 8, 2026Updated 2 weeks ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- 使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory☆57Sep 8, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [COLING 2025] Official Repo for Paper "Beyond Boundaries: Learning Universal Entity Taxonomy across Datasets and Languages for Open Named…☆27Feb 5, 2026Updated last month
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆68,728Mar 18, 2026Updated last week
- SwanLab Official Documentation | SwanLab官方文档☆23Mar 19, 2026Updated last week
- ⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …☆3,718Updated this week
- 一些 LLM 方面的从零复现笔记☆246Apr 29, 2025Updated 10 months ago
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆23,641Mar 12, 2026Updated last week
- 本项目旨在提供一个微调酒店推荐垂直领域大模型并应用的完整闭环案例作为大家的参考案例。本项目使用的基础大模型为Qwen2.5-7B-Instruct。项目特色:完整的垂直应用案例闭环、项目源码剖析开源共享、详实的图文指导手册、手把手全流程实操演示视频☆88Apr 23, 2025Updated 11 months ago
- ☆13Feb 17, 2025Updated last year
- 一个包含了多种主流大模型微调方案的实战代码库,基于Qwen3系列模型☆124Aug 10, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 阿里云天池 - GLM 法律行业大模型挑战赛 - 我们小组实现基于大模型的对话机器人源码☆17Oct 23, 2024Updated last year
- Smart LLM/Agent Management in One Line of Code☆21Mar 4, 2026Updated 3 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆792Apr 27, 2025Updated 10 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题☆13,487Apr 30, 2025Updated 10 months ago
- 大模型学习--从模型部署到模型微调,此项目是经过训练营学习后,结合训练营项目,自我理解消化总结,以及创新型应用。可star/fork☆21Mar 26, 2024Updated 2 years ago
- 整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。☆22,469May 19, 2025Updated 10 months ago
- 使用BERT-BILSTM-CRF进行中文命名实体识别。☆489Jan 9, 2025Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,652Oct 24, 2024Updated last year
- ☆11Nov 9, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- GraphRAG 中文文档。GraphRAG是一种结构化的、分层的检索增强生成(RAG)方法,而不是使用纯文本片段的语义搜索方法。GraphRAG 过程包括从原始文本中提取出知识图谱,构建社群层级(这种结构通常用来描述个体、群体及它们之间的关系,帮助理解信息如何在社群内部传…☆19Jul 12, 2024Updated last year
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆7,075Jul 4, 2025Updated 8 months ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆33Feb 10, 2026Updated last month
- ☆29Feb 27, 2025Updated last year
- 大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama☆29Jun 26, 2023Updated 2 years ago
- ☆13Jun 5, 2024Updated last year
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,782Dec 12, 2023Updated 2 years ago