meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
☆16,206Updated this week
Alternatives and similar repositories for llama-cookbook:
Users that are interested in llama-cookbook are comparing it to the libraries listed below
- A high-throughput and memory-efficient inference and serving engine for LLMs☆38,093Updated this week
- Large Language Model Text Generation Inference☆9,756Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆9,679Updated this week
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆29,780Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,242Updated 8 months ago
- The official Meta Llama 3 GitHub site☆28,309Updated 3 weeks ago
- Inference code for Llama models☆57,622Updated 3 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,568Updated 2 weeks ago
- Go ahead and axolotl questions☆8,620Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆17,319Updated this week
- DSPy: The framework for programming—not prompting—language models☆21,882Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,500Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,201Updated 9 months ago
- PyTorch native post-training library☆4,834Updated this week
- Train transformer language models with reinforcement learning.☆11,688Updated this week
- Inference Llama 2 in one file of pure C☆18,027Updated 6 months ago
- Fast and memory-efficient exact attention☆15,503Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆13,406Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆37,796Updated this week
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…☆9,422Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,444Updated 6 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,814Updated 2 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆40,550Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆11,290Updated 2 months ago
- Inference code for CodeLlama models☆16,205Updated 6 months ago
- Instruct-tune LLaMA on consumer hardware☆18,803Updated 6 months ago
- Universal LLM Deployment Engine with ML Compilation☆19,972Updated this week
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,792Updated 7 months ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,440Updated last year
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆38,980Updated this week