What would you do with 1000 H100s...
☆1,154Jan 10, 2024Updated 2 years ago
Alternatives and similar repositories for LLM-Training-Puzzles
Users that are interested in LLM-Training-Puzzles are comparing it to the libraries listed below
Sorting:
- Puzzles for exploring transformers☆386May 4, 2023Updated 2 years ago
- ☆497Oct 18, 2024Updated last year
- A puzzle to learn about prompting☆135May 12, 2023Updated 2 years ago
- Solve puzzles. Improve your pytorch.☆3,950Jul 15, 2024Updated last year
- Puzzles for learning Triton☆2,314Nov 18, 2024Updated last year
- Solve puzzles. Learn CUDA.☆11,970Sep 1, 2024Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,090Aug 26, 2025Updated 6 months ago
- Annotated version of the Mamba paper☆497Feb 27, 2024Updated 2 years ago
- Minimalistic large language model 3D-parallelism training☆2,579Feb 19, 2026Updated last week
- A PyTorch native platform for training generative AI models☆5,098Updated this week
- GPU programming related news and material links☆1,997Sep 17, 2025Updated 5 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆831Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,184Aug 22, 2025Updated 6 months ago
- Tile primitives for speedy kernels☆3,183Updated this week
- Machine Learning Engineering Open Book☆17,162Feb 21, 2026Updated last week
- 🚀 Efficient implementations of state-of-the-art linear attention models☆4,428Updated this week
- A bibliography and survey of the papers surrounding o1☆1,212Nov 16, 2024Updated last year
- A simple, performant and scalable Jax LLM!☆2,148Feb 24, 2026Updated last week
- Efficient Triton Kernels for LLM Training☆6,162Updated this week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆695Jan 26, 2026Updated last month
- Implementation of https://srush.github.io/annotated-s4☆512Jun 20, 2025Updated 8 months ago
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,752Jul 18, 2025Updated 7 months ago
- ☆93Jul 5, 2024Updated last year
- Accessible large language models via k-bit quantization for PyTorch.☆7,997Updated this week
- Experiment of using Tangent to autodiff triton☆82Jan 22, 2024Updated 2 years ago
- Development repository for the Triton language and compiler☆18,501Updated this week
- ☆4,110Jun 4, 2024Updated last year
- Ring attention implementation with flash attention☆986Sep 10, 2025Updated 5 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,017Aug 21, 2024Updated last year
- Fast and memory-efficient exact attention☆22,361Updated this week
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,585Jan 28, 2026Updated last month
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,903Feb 24, 2026Updated last week
- Robust recipes to align language models with human and AI preferences☆5,506Sep 8, 2025Updated 5 months ago
- ☆316Jun 21, 2024Updated last year
- ☆22Apr 22, 2024Updated last year
- PyTorch native quantization and sparsity for training and inference☆2,707Updated this week
- Material for gpu-mode lectures☆5,773Feb 1, 2026Updated last month
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,741Jan 8, 2024Updated 2 years ago
- A tiny library for coding with large language models.☆1,233Jul 10, 2024Updated last year