Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
☆4,755Jul 18, 2025Updated 8 months ago
Alternatives and similar repositories for lingua
Users that are interested in lingua are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch native platform for training generative AI models☆5,220Updated this week
- Official inference framework for 1-bit LLMs☆38,049Mar 10, 2026Updated last month
- Minimalistic large language model 3D-parallelism training☆2,644Apr 7, 2026Updated last week
- Efficient Triton Kernels for LLM Training☆6,265Updated this week
- PyTorch native post-training library☆5,728Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- NanoGPT (124M) in 2 minutes☆5,070Mar 29, 2026Updated 2 weeks ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,194Aug 22, 2025Updated 7 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,983Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,146Aug 26, 2025Updated 7 months ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,094Jul 29, 2024Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆3,431Nov 13, 2024Updated last year
- Train transformer language models with reinforcement learning.☆17,967Apr 7, 2026Updated last week
- AllenAI's post-training codebase☆3,683Updated this week
- 🚀 Efficient implementations for emerging model architectures☆4,878Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fully open reproduction of DeepSeek-R1☆25,973Apr 2, 2026Updated last week
- A framework for few-shot evaluation of language models.☆12,138Updated this week
- Tools for merging pretrained large language models.☆6,973Mar 15, 2026Updated 3 weeks ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,603Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆25,643Updated this week
- Schedule-Free Optimization in PyTorch☆2,271May 21, 2025Updated 10 months ago
- Fast and memory-efficient exact attention☆23,344Updated this week
- Modeling, training, eval, and inference code for OLMo☆6,463Nov 24, 2025Updated 4 months ago
- Code for BLT research paper☆2,032Nov 3, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,297Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆13,038Feb 27, 2026Updated last month
- Robust recipes to align language models with human and AI preferences☆5,558Updated this week
- Muon is Scalable for LLM Training☆1,453Aug 3, 2025Updated 8 months ago
- A bibliography and survey of the papers surrounding o1☆1,213Nov 16, 2024Updated last year
- Helpful tools and examples for working with flex-attention☆1,174Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆76,536Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,278Apr 1, 2026Updated last week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆21,284Mar 11, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,374Apr 7, 2026Updated last week
- Tile primitives for speedy kernels☆3,312Apr 8, 2026Updated last week
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…☆9,340Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.☆61,312Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆10,010Mar 4, 2026Updated last month
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.☆6,905Dec 17, 2025Updated 3 months ago
- Ongoing research training transformer models at scale☆15,985Updated this week