meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

☆15,222

Related projects ⓘ

Alternatives and complementary repositories for llama-recipes

Lightning-AI / litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆10,734Updated last week
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆30,423Updated this week
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,059Updated 5 months ago
mistralai / mistral-inference
Official inference library for Mistral models
☆9,738Updated last week
haotian-liu / LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆20,286Updated 3 months ago
jzhang38 / TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆7,919Updated 6 months ago
meta-llama / llama
Inference code for Llama models
☆56,450Updated 3 months ago
unslothai / unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
☆18,263Updated this week
meta-llama / codellama
Inference code for CodeLlama models
☆16,044Updated 3 months ago
stanfordnlp / dspy
DSPy: The framework for programming—not prompting—language models
☆18,885Updated this week
huggingface / trl
Train transformer language models with reinforcement learning.
☆10,086Updated this week
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆9,122Updated this week
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆16,471Updated this week
chroma-core / chroma
the AI-native open-source embedding database
☆15,448Updated this week
ShishirPatil / gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆11,487Updated this week
pytorch / torchtune
PyTorch native finetuning library
☆4,336Updated this week
abetlen / llama-cpp-python
Python bindings for llama.cpp
☆8,141Updated this week
bentoml / OpenLLM
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
☆10,079Updated this week
nlpxucan / WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,269Updated 3 months ago
hiyouga / LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
☆34,589Updated this week
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆14,279Updated this week
run-llama / llama_index
LlamaIndex is a data framework for your LLM applications
☆36,820Updated this week
tatsu-lab / stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆29,561Updated 4 months ago
Lightning-AI / lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆5,994Updated 2 months ago
meta-llama / llama3
The official Meta Llama 3 GitHub site
☆27,145Updated 3 months ago
axolotl-ai-cloud / axolotl
Go ahead and axolotl questions
☆7,930Updated this week
dottxt-ai / outlines
Structured Text Generation
☆9,487Updated this week
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆36,993Updated this week
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆10,776Updated 3 months ago
tloen / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆18,653Updated 3 months ago