meta-llama / llama-models
Utilities intended for use with Llama models.
☆4,852Updated this week
Related projects ⓘ
Alternatives and complementary repositories for llama-models
- Agentic components of the Llama Stack APIs☆3,894Updated this week
- Composable building blocks to build Llama Apps☆4,594Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,383Updated this week
- Set of tools to assess and improve LLM security.☆2,721Updated this week
- PyTorch native finetuning library☆4,336Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,227Updated last week
- Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom dataset…☆15,222Updated this week
- Ollama Python library☆4,633Updated this week
- ☆4,035Updated 5 months ago
- DataComp for Language Models☆1,157Updated this week
- ☆8,482Updated last month
- Parse files for optimal RAG☆3,173Updated last week
- The official Meta Llama 3 GitHub site☆27,145Updated 3 months ago
- CoreNet: A library for training deep neural networks☆6,983Updated last month
- Modeling, training, eval, and inference code for OLMo☆4,645Updated this week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆9,783Updated this week
- ☆2,746Updated 2 months ago
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆6,899Updated last week
- The official PyTorch implementation of Google's Gemma models☆5,290Updated 3 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,195Updated 4 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,256Updated 3 months ago
- ☆2,898Updated last month
- A simple screen parsing tool towards pure vision based GUI agent☆4,768Updated 2 weeks ago
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆2,710Updated this week
- Tools for merging pretrained large language models.☆4,816Updated 2 weeks ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆2,673Updated 3 months ago
- Efficient Triton Kernels for LLM Training☆3,454Updated this week
- High-quality datasets, tools, and concepts for LLM fine-tuning.☆2,010Updated 3 weeks ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,571Updated last week