huggingface/nanotron

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huggingface/nanotron)

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

☆2,761

Alternatives and similar repositories for nanotron

Users that are interested in nanotron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huggingface / picotron
View on GitHub
Minimalistic 4D-parallelism distributed training framework for education purpose
☆2,255Aug 26, 2025Updated 10 months ago
huggingface / datatrove
View on GitHub
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
☆3,221Updated this week
huggingface / lighteval
View on GitHub
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆2,494Jun 29, 2026Updated 3 weeks ago
pytorch / torchtitan
View on GitHub
A PyTorch native platform for training generative AI models
☆5,554Updated this week
linkedin / Liger-Kernel
View on GitHub
Efficient Triton Kernels for LLM Training
☆6,532Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
NVIDIA / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆17,181Updated this week
NVIDIA / TransformerEngine
View on GitHub
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…
☆3,441Updated this week
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,913Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,626Updated this week
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,643May 26, 2026Updated last month
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,258Jun 17, 2026Updated last month
EleutherAI / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of language models.
☆13,390Jul 13, 2026Updated last week
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆24,519Updated this week
meta-pytorch / torchtune
View on GitHub
PyTorch native post-training library
☆5,785Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
databricks / megablocks
View on GitHub
☆1,582Mar 25, 2026Updated 3 months ago
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,607Updated this week
sgl-project / mini-sglang
View on GitHub
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
☆4,628May 17, 2026Updated 2 months ago
huggingface / smollm
View on GitHub
Everything about the SmolLM and SmolVLM family of models
☆3,851May 26, 2026Updated last month
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,807Updated this week
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆14,619Apr 26, 2026Updated 2 months ago
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,841Jul 14, 2026Updated last week
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations for emerging model architectures
☆5,401Updated this week
facebookresearch / lingua
View on GitHub
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
☆4,757Jul 18, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,669Updated this week
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆6,014Updated this week
pytorch / ao
View on GitHub
PyTorch native quantization and sparsity for training and inference
☆2,910Updated this week
huggingface / picotron_tutorial
View on GitHub
☆253Nov 24, 2025Updated 8 months ago
bitsandbytes-foundation / bitsandbytes
View on GitHub
Accessible large language models via k-bit quantization for PyTorch.
☆8,337Updated this week
argilla-io / distilabel
View on GitHub
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆3,342Updated this week
meta-pytorch / gpt-fast
View on GitHub
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,229Aug 22, 2025Updated 11 months ago
huggingface / kernels
View on GitHub
Build compute kernels and load them from the Hub.
☆715Updated this week
HazyResearch / ThunderKittens
View on GitHub
Tile primitives for speedy kernels
☆3,561Jul 13, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
KellerJordan / modded-nanogpt
View on GitHub
NanoGPT (124M) in 90 seconds
☆5,577Jul 3, 2026Updated 3 weeks ago
zhuzilin / ring-flash-attention
View on GitHub
Ring attention implementation with flash attention
☆1,037Sep 10, 2025Updated 10 months ago
radixark / miles
View on GitHub
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
☆1,779Updated this week
allenai / OLMo
View on GitHub
Modeling, training, eval, and inference code for OLMo
☆6,602Nov 24, 2025Updated 8 months ago
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,238Updated this week
huggingface / text-generation-inference
View on GitHub
Large Language Model Text Generation Inference
☆10,882Mar 21, 2026Updated 4 months ago
huggingface / cosmopedia
View on GitHub
☆572Nov 20, 2024Updated last year