meta-llama / llama-modelsLinks
Utilities intended for use with Llama models.
☆7,400Updated 2 weeks ago
Alternatives and similar repositories for llama-models
Users that are interested in llama-models are comparing it to the libraries listed below
Sorting:
- Agentic components of the Llama Stack APIs☆4,280Updated 4 months ago
- Composable building blocks to build LLM Apps☆8,210Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,121Updated last month
- The official Meta Llama 3 GitHub site☆29,148Updated 11 months ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆25,863Updated 2 months ago
- Set of tools to assess and improve LLM security.☆3,947Updated last week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,623Updated 3 months ago
- Ollama Python library☆9,069Updated 2 weeks ago
- PyTorch native post-training library☆5,629Updated this week
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,637Updated last week
- SGLang is a fast serving framework for large language models and vision language models.☆21,945Updated this week
- Tools for merging pretrained large language models.☆6,630Updated last week
- ☆3,467Updated 9 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,313Updated 11 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆17,792Updated this week
- Train transformer language models with reinforcement learning.☆16,809Updated this week
- Gemma open-weight LLM library, from Google DeepMind☆3,908Updated last month
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,481Updated this week
- Fast and memory-efficient exact attention☆21,317Updated this week
- Go ahead and axolotl questions☆11,005Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,852Updated last year
- Accessible large language models via k-bit quantization for PyTorch.☆7,855Updated 2 weeks ago
- Sky-T1: Train your own O1 preview model within $450☆3,363Updated 5 months ago
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆17,425Updated last month
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,706Updated last month
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,987Updated last year
- llama3 implementation one matrix multiplication at a time☆15,203Updated last year
- Fully open reproduction of DeepSeek-R1☆25,772Updated last month
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,750Updated 5 months ago
- ☆3,054Updated last month