meta-llama / llama-models
Utilities intended for use with Llama models.
☆6,831Updated last week
Alternatives and similar repositories for llama-models:
Users that are interested in llama-models are comparing it to the libraries listed below
- Agentic components of the Llama Stack APIs☆4,209Updated 2 weeks ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,095Updated this week
- Large Concept Models: Language modeling in a sentence representation space☆2,098Updated 2 months ago
- Fully open reproduction of DeepSeek-R1☆24,020Updated this week
- Composable building blocks to build Llama Apps☆7,714Updated this week
- s1: Simple test-time scaling☆6,217Updated 2 weeks ago
- The official Meta Llama 3 GitHub site☆28,632Updated 2 months ago
- Set of tools to assess and improve LLM security.☆3,042Updated 2 months ago
- PyTorch native post-training library☆5,103Updated this week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆9,844Updated last week
- verl: Volcano Engine Reinforcement Learning for LLMs☆6,909Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆13,368Updated this week
- ☆3,284Updated last month
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆16,726Updated last month
- NanoGPT (124M) in 3 minutes☆2,493Updated 3 weeks ago
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,140Updated 2 months ago
- Finetune Llama 4, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆37,364Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,514Updated 2 weeks ago
- Ongoing research training transformer models at scale☆12,118Updated this week
- Go ahead and axolotl questions☆9,137Updated this week
- Large Language Model Text Generation Inference☆10,031Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,004Updated this week
- Fast and memory-efficient exact attention☆16,929Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆45,116Updated this week
- Sky-T1: Train your own O1 preview model within $450☆3,209Updated this week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆6,564Updated last week
- Everything you need to build state-of-the-art foundation models, end-to-end.☆7,881Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,567Updated this week
- nanoGPT style version of Llama 3.1☆1,356Updated 8 months ago
- AllenAI's post-training codebase☆2,913Updated this week