hiyouga / LLaMA-FactoryLinks

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

☆54,711

Alternatives and similar repositories for LLaMA-Factory

Users that are interested in LLaMA-Factory are comparing it to the libraries listed below

Sorting:

modelscope / ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-…
☆8,834Updated this week
FlagOpen / FlagEmbedding
Retrieval and Retrieval-augmented LLMs
☆10,191Updated last week
QwenLM / Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
☆18,742Updated last month
LlamaFamily / Llama-Chinese
Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用
☆14,647Updated 3 months ago
unslothai / unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
☆42,597Updated this week
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆52,682Updated this week
xorbitsai / inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…
☆8,250Updated this week
QwenLM / Qwen-Agent
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
☆10,241Updated last month
THUDM / GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
☆6,705Updated 3 weeks ago
microsoft / graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆26,742Updated this week
QwenLM / Qwen3
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
☆22,774Updated this week
yangjianxin1 / Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…
☆6,489Updated 9 months ago
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆19,116Updated this week
HqWu-HITCS / Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
☆20,695Updated 2 months ago
open-compass / opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …
☆5,734Updated this week
QwenLM / Qwen2.5-VL
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆11,654Updated 2 months ago
huggingface / trl
Train transformer language models with reinforcement learning.
☆14,675Updated this week
volcengine / verl
verl: Volcano Engine Reinforcement Learning for LLMs
☆11,391Updated this week
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
☆16,236Updated this week
haotian-liu / LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆23,134Updated 11 months ago
meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆17,654Updated this week
ymcui / Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
☆18,890Updated last week
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆18,448Updated last week
chatchat-space / Langchain-Chatchat
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…
☆35,673Updated 4 months ago
QwenLM / Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
☆6,114Updated 11 months ago
InternLM / lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
☆6,760Updated this week
ConardLi / easy-dataset
A powerful tool for creating fine-tuning datasets for LLM
☆9,625Updated last week
meta-llama / llama3
The official Meta Llama 3 GitHub site
☆28,857Updated 6 months ago
InternLM / InternLM
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
☆6,983Updated 5 months ago
InternLM / xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
☆4,654Updated 2 weeks ago