hiyouga / LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
☆34,589Updated this week
Related projects ⓘ
Alternatives and complementary repositories for LLaMA-Factory
- Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory☆18,263Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆16,471Updated this week
- Retrieval and Retrieval-augmented LLMs☆7,613Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆30,423Updated this week
- Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom dataset…☆15,222Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆20,286Updated 3 months ago
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆9,783Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆10,734Updated last week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆14,195Updated last week
- A series of large language models trained from scratch by developers @01-ai☆7,711Updated last week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆36,993Updated this week
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆12,672Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,059Updated 5 months ago
- Train transformer language models with reinforcement learning.☆10,086Updated this week
- The Memory layer for your AI apps☆22,875Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆7,919Updated 6 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆12,427Updated last month
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆5,422Updated this week
- Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…☆32,102Updated this week
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆33,080Updated this week
- Universal LLM Deployment Engine with ML Compilation☆19,215Updated this week
- Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用☆14,012Updated 2 months ago
- Official release of InternLM2.5 base and chat models. 1M context support☆6,482Updated this week
- Latest Advances on Multimodal Large Language Models☆12,722Updated this week
- Official inference library for Mistral models☆9,738Updated last week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆23,277Updated this week
- Inference code for CodeLlama models☆16,044Updated 3 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆19,247Updated this week
- LlamaIndex is a data framework for your LLM applications☆36,820Updated this week
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆6,852Updated 3 months ago