meta-llama / llama-modelsLinks

Utilities intended for use with Llama models.

☆7,096

Alternatives and similar repositories for llama-models

Users that are interested in llama-models are comparing it to the libraries listed below

Sorting:

meta-llama / llama-stack-apps
Agentic components of the Llama Stack APIs
☆4,264Updated last month
meta-llama / llama-stack
Composable building blocks to build Llama Apps
☆7,864Updated this week
meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆17,516Updated last week
QwenLM / Qwen3
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
☆22,175Updated 2 weeks ago
ollama / ollama-python
Ollama Python library
☆7,907Updated last week
meta-llama / llama3
The official Meta Llama 3 GitHub site
☆28,797Updated 5 months ago
meta-llama / PurpleLlama
Set of tools to assess and improve LLM security.
☆3,505Updated last week
QwenLM / Qwen-Agent
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
☆9,709Updated last week
huggingface / open-r1
Fully open reproduction of DeepSeek-R1
☆24,859Updated this week
openai / simple-evals
☆3,740Updated last month
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
☆15,421Updated this week
google / gemma_pytorch
The official PyTorch implementation of Google's Gemma models
☆5,484Updated 3 weeks ago
pytorch / torchtune
PyTorch native post-training library
☆5,287Updated this week
NovaSky-AI / SkyThought
Sky-T1: Train your own O1 preview model within $450
☆3,272Updated last month
simplescaling / s1
s1: Simple test-time scaling
☆6,455Updated last month
QwenLM / Qwen2.5-Coder
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
☆5,029Updated last week
allenai / OLMo
Modeling, training, eval, and inference code for OLMo
☆5,702Updated last week
apple / corenet
CoreNet: A library for training deep neural networks
☆7,016Updated last month
Jiayi-Pan / TinyZero
Minimal reproduction of DeepSeek R1-Zero
☆11,926Updated 2 months ago
QwenLM / Qwen2.5-VL
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆11,207Updated last month
meta-llama / codellama
Inference code for CodeLlama models
☆16,339Updated 10 months ago
google-deepmind / gemma
Gemma open-weight LLM library, from Google DeepMind
☆3,434Updated this week
unslothai / unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
☆41,118Updated this week
pytorch / torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
☆3,596Updated last month
anthropics / anthropic-quickstarts
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
☆9,169Updated 3 weeks ago
kyutai-labs / moshi
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆8,484Updated this week
mistralai / mistral-inference
Official inference library for Mistral models
☆10,307Updated 3 months ago
agentica-project / rllm
Democratizing Reinforcement Learning for LLMs
☆3,396Updated last month
MoonshotAI / Kimi-k1.5
☆3,374Updated 3 months ago
volcengine / verl
verl: Volcano Engine Reinforcement Learning for LLMs
☆9,958Updated this week