facebookresearch / lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
☆4,227Updated last week
Related projects ⓘ
Alternatives and complementary repositories for lingua
- Official inference framework for 1-bit LLMs☆11,271Updated last week
- Composable building blocks to build Llama Apps☆4,594Updated this week
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.☆2,489Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,383Updated this week
- Efficient Triton Kernels for LLM Training☆3,454Updated this week
- PyTorch native finetuning library☆4,336Updated this week
- Agentic components of the Llama Stack APIs☆3,894Updated this week
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,840Updated 3 months ago
- A native PyTorch Library for large model training☆2,623Updated this week
- Entropy Based Sampling and Parallel CoT Decoding☆3,036Updated last week
- ☆2,746Updated 2 months ago
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.☆5,211Updated this week
- PyTorch native quantization and sparsity for training and inference☆1,585Updated this week
- nanoGPT style version of Llama 3.1☆1,246Updated 3 months ago
- NanoGPT (124M) quality in 7.8 8xH100-minutes☆1,033Updated this week
- High-quality datasets, tools, and concepts for LLM fine-tuning.☆2,010Updated 3 weeks ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆3,540Updated 2 weeks ago
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆3,906Updated last month
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,155Updated 2 weeks ago
- Video+code lecture on building nanoGPT from scratch☆3,611Updated 3 months ago
- Schedule-Free Optimization in PyTorch☆1,898Updated 2 weeks ago
- ☆1,954Updated 3 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,045Updated this week
- A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API☆6,841Updated this week
- Utilities intended for use with Llama models.☆4,852Updated this week
- Blazingly fast LLM inference.☆4,472Updated this week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆2,602Updated last week
- 4M: Massively Multimodal Masked Modeling☆1,607Updated last month
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆890Updated 2 months ago
- DataComp for Language Models☆1,157Updated this week