Lightning-AI / lightning-thunderLinks

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

☆1,424

Alternatives and similar repositories for lightning-thunder

Users that are interested in lightning-thunder are comparing it to the libraries listed below

Sorting:

pytorch / ao
PyTorch native quantization and sparsity for training and inference
☆2,543Updated this week
Lightning-AI / litData
Speed up model training by fixing data loading.
☆561Updated last week
facebookresearch / schedule_free
Schedule-Free Optimization in PyTorch
☆2,237Updated 6 months ago
HazyResearch / ThunderKittens
Tile primitives for speedy kernels
☆2,955Updated this week
pytorch / torchtitan
A PyTorch native platform for training generative AI models
☆4,778Updated this week
huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆2,351Updated last week
pytorch / tensordict
TensorDict is a pytorch dedicated tensor container.
☆988Updated last week
clu0 / unet.cu
UNet diffusion model in pure CUDA
☆655Updated last year
huggingface / optimum-nvidia
☆1,011Updated 9 months ago
facebookresearch / optimizers
For optimization algorithm research and development.
☆547Updated 2 weeks ago
AI-Hypercomputer / maxtext
A simple, performant and scalable Jax LLM!
☆2,006Updated this week
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,901Updated 3 months ago
huggingface / optimum-quanto
A pytorch quantization backend for optimum
☆1,011Updated last week
BobMcDear / attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
☆584Updated 3 months ago
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆932Updated 2 weeks ago
srush / Triton-Puzzles
Puzzles for learning Triton
☆2,143Updated last year
dropbox / hqq
Official implementation of Half-Quadratic Quantization (HQQ)
☆894Updated last month
google-ai-edge / model-explorer
A modern model graph visualizer and debugger
☆1,342Updated this week
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆829Updated 4 months ago
AnswerDotAI / fsdp_qlora
Training LLMs with QLoRA + FSDP
☆1,534Updated last year
meta-pytorch / torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆454Updated 3 weeks ago
google-deepmind / penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
☆1,828Updated 5 months ago
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆543Updated last month
facebookresearch / spdl
Scalable and Performant Data Loading
☆345Updated last week
meta-pytorch / attention-gym
Helpful tools and examples for working with flex-attention
☆1,062Updated 2 weeks ago
gpu-mode / resource-stream
GPU programming related news and material links
☆1,803Updated 2 months ago
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,067Updated last year
KellerJordan / modded-nanogpt
NanoGPT (124M) in 3 minutes
☆3,911Updated last week
NVIDIA / TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…
☆2,954Updated last week
mlops-discord / gpu-optimization-workshop
Slides, notes, and materials for the workshop
☆335Updated last year