hiyouga / LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
☆41,097Updated this week
Alternatives and similar repositories for LLaMA-Factory:
Users that are interested in LLaMA-Factory are comparing it to the libraries listed below
- A high-throughput and memory-efficient inference and serving engine for LLMs☆38,475Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆17,363Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆16,936Updated 2 weeks ago
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆30,529Updated this week
- Retrieval and Retrieval-augmented LLMs☆8,555Updated last week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,243Updated this week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,852Updated 3 weeks ago
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆15,511Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,480Updated 6 months ago
- Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用☆14,426Updated 5 months ago
- Train transformer language models with reinforcement learning.☆11,782Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆10,325Updated this week
- Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 15…☆5,674Updated this week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆7,059Updated last month
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆6,428Updated this week
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆7,189Updated 6 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆22,523Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆37,834Updated this week
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,353Updated 8 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆5,603Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆11,325Updated 2 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,470Updated 6 months ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,147Updated 3 months ago
- ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型☆13,625Updated last month
- DSPy: The framework for programming—not prompting—language models☆21,930Updated this week
- Fast and memory-efficient exact attention☆15,541Updated this week
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,705Updated 9 months ago
- Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…☆33,556Updated 2 months ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆18,531Updated this week
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆4,681Updated last week