meta-llama / llama-cookbookLinks
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
β17,767Updated last week
Alternatives and similar repositories for llama-cookbook
Users that are interested in llama-cookbook are comparing it to the libraries listed below
Sorting:
- Train transformer language models with reinforcement learning.β15,259Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β19,390Updated this week
- The official Meta Llama 3 GitHub siteβ28,931Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ56,349Updated this week
- Go ahead and axolotl questionsβ10,289Updated this week
- Inference code for Llama modelsβ58,668Updated 7 months ago
- A framework for few-shot evaluation of language models.β9,906Updated this week
- PyTorch native post-training libraryβ5,426Updated this week
- Large Language Model Text Generation Inferenceβ10,442Updated last week
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,630Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.β12,663Updated last week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β15,682Updated 2 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.β17,106Updated this week
- Official inference library for Mistral modelsβ10,429Updated 5 months ago
- Universal LLM Deployment Engine with ML Compilationβ21,172Updated this week
- Tools for merging pretrained large language models.β6,210Updated last week
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% lessβ¦β44,634Updated this week
- Composable building blocks to build Llama Appsβ7,985Updated this week
- Fast and memory-efficient exact attentionβ19,099Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.β8,710Updated last year
- Python bindings for llama.cppβ9,515Updated last week
- Robust recipes to align language models with human and AI preferencesβ5,329Updated 3 weeks ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β56,573Updated last week
- Inference code for CodeLlama modelsβ16,351Updated last year
- Minimal reproduction of DeepSeek R1-Zeroβ12,138Updated 4 months ago
- Awesome-LLM: a curated list of Large Language Modelβ24,826Updated 3 weeks ago
- Modeling, training, eval, and inference code for OLMoβ5,911Updated 2 weeks ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β43,912Updated this week
- Accessible large language models via k-bit quantization for PyTorch.β7,490Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β23,378Updated last year