meta-llama / llama-modelsLinks
Utilities intended for use with Llama models.
☆7,048Updated this week
Alternatives and similar repositories for llama-models
Users that are interested in llama-models are comparing it to the libraries listed below
Sorting:
- Agentic components of the Llama Stack APIs☆4,248Updated last month
- Composable building blocks to build Llama Apps☆7,823Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆14,814Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,431Updated this week
- The official Meta Llama 3 GitHub site☆28,755Updated 4 months ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,890Updated 2 months ago
- Official inference library for Mistral models☆10,275Updated 2 months ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆21,778Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,204Updated last week
- s1: Simple test-time scaling☆6,425Updated 2 weeks ago
- Ollama Python library☆7,738Updated last week
- ☆3,355Updated 3 months ago
- PyTorch native post-training library☆5,233Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆48,865Updated this week
- Set of tools to assess and improve LLM security.☆3,436Updated last week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆8,356Updated last week
- Train transformer language models with reinforcement learning.☆14,046Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆8,850Updated this week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆10,773Updated 3 weeks ago
- Inference code for Llama models☆58,316Updated 4 months ago
- The official Python SDK for Model Context Protocol servers and clients☆13,591Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆9,200Updated last week
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…☆10,629Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18,432Updated last month
- Fully open reproduction of DeepSeek-R1☆24,692Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,588Updated 2 weeks ago
- Gemma open-weight LLM library, from Google DeepMind☆3,372Updated this week
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆4,521Updated this week
- The official PyTorch implementation of Google's Gemma models☆5,472Updated last week
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,340Updated this week