meta-llama / llama-modelsLinks
Utilities intended for use with Llama models.
☆7,143Updated last month
Alternatives and similar repositories for llama-models
Users that are interested in llama-models are comparing it to the libraries listed below
Sorting:
- Agentic components of the Llama Stack APIs☆4,266Updated 2 months ago
- Composable building blocks to build Llama Apps☆7,907Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,625Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,600Updated last week
- PyTorch native post-training library☆5,347Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆16,097Updated this week
- Gemma open-weight LLM library, from Google DeepMind☆3,517Updated last week
- CoreNet: A library for training deep neural networks☆7,013Updated 2 months ago
- The official PyTorch implementation of Google's Gemma models☆5,504Updated last month
- A PyTorch native platform for training generative AI models☆4,056Updated this week
- ☆2,984Updated 10 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆15,104Updated 4 months ago
- Modeling, training, eval, and inference code for OLMo☆5,778Updated this week
- Set of tools to assess and improve LLM security.☆3,599Updated 2 weeks ago
- s1: Simple test-time scaling☆6,501Updated 3 weeks ago
- Fully open reproduction of DeepSeek-R1☆25,056Updated last week
- Ollama Python library☆8,010Updated last week
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆4,957Updated 4 months ago
- NanoGPT (124M) in 3 minutes☆2,811Updated this week
- ☆3,851Updated last week
- DataComp for Language Models☆1,324Updated 3 months ago
- Everything about the SmolLM and SmolVLM family of models☆2,909Updated last week
- verl: Volcano Engine Reinforcement Learning for LLMs☆11,168Updated this week
- The official Meta Llama 3 GitHub site☆28,838Updated 5 months ago
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆11,551Updated 2 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,246Updated 5 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.☆42,005Updated this week
- Fast and memory-efficient exact attention☆18,340Updated this week
- AllenAI's post-training codebase☆3,061Updated this week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,923Updated 9 months ago