What would you do with 1000 H100s...
☆1,173Jan 10, 2024Updated 2 years ago
Alternatives and similar repositories for LLM-Training-Puzzles
Users that are interested in LLM-Training-Puzzles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Puzzles for exploring transformers☆392May 4, 2023Updated 3 years ago
- ☆504Oct 18, 2024Updated last year
- A puzzle to learn about prompting☆138May 12, 2023Updated 3 years ago
- Solve puzzles. Improve your pytorch.☆4,065Jul 15, 2024Updated last year
- Puzzles for learning Triton☆2,457Apr 1, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Solve puzzles. Learn CUDA.☆12,179Sep 1, 2024Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,188Aug 26, 2025Updated 9 months ago
- Annotated version of the Mamba paper☆501Feb 27, 2024Updated 2 years ago
- GPU programming related news and material links☆2,142Mar 8, 2026Updated 2 months ago
- A PyTorch native platform for training generative AI models☆5,362May 21, 2026Updated last week
- Minimalistic large language model 3D-parallelism training☆2,698Apr 7, 2026Updated last month
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,212Aug 22, 2025Updated 9 months ago
- Machine Learning Engineering Open Book☆18,006May 18, 2026Updated last week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆839Mar 15, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tile primitives for speedy kernels☆3,377May 22, 2026Updated last week
- A bibliography and survey of the papers surrounding o1☆1,214Nov 16, 2024Updated last year
- 🚀 Efficient implementations for emerging model architectures☆5,139Updated this week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆708Jan 26, 2026Updated 4 months ago
- Efficient Triton Kernels for LLM Training☆6,365May 18, 2026Updated last week
- Implementation of https://srush.github.io/annotated-s4☆517Jun 20, 2025Updated 11 months ago
- Experiment of using Tangent to autodiff triton☆82Jan 22, 2024Updated 2 years ago
- A simple, performant and scalable Jax LLM!☆2,295Updated this week
- ☆22Apr 22, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Development repository for the Triton language and compiler☆19,246May 22, 2026Updated last week
- Ring attention implementation with flash attention☆1,021Sep 10, 2025Updated 8 months ago
- Accessible large language models via k-bit quantization for PyTorch.☆8,216May 20, 2026Updated last week
- ☆93Jul 5, 2024Updated last year
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,762Jul 18, 2025Updated 10 months ago
- ☆4,113Apr 15, 2026Updated last month
- Fast and memory-efficient exact attention☆23,917Updated this week
- Material for gpu-mode lectures☆6,098May 9, 2026Updated 2 weeks ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,585Jan 28, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A tiny library for coding with large language models.☆1,234Jul 10, 2024Updated last year
- ☆581Jul 11, 2024Updated last year
- FlashInfer: Kernel Library for LLM Serving☆5,666Updated this week
- ☆329Updated this week
- Robust recipes to align language models with human and AI preferences☆5,605Apr 8, 2026Updated last month
- Implementation of a Transformer, but completely in Triton☆278Apr 5, 2022Updated 4 years ago
- NanoGPT (124M) in 90 seconds☆5,270May 14, 2026Updated 2 weeks ago