srush / GPU-Puzzles
Solve puzzles. Learn CUDA.
☆10,878Updated 7 months ago
Alternatives and similar repositories for GPU-Puzzles:
Users that are interested in GPU-Puzzles are comparing it to the libraries listed below
- Solve puzzles. Improve your pytorch.☆3,526Updated 9 months ago
- Material for gpu-mode lectures☆4,245Updated 2 months ago
- Puzzles for learning Triton☆1,577Updated 5 months ago
- GPU programming related news and material links☆1,454Updated 3 months ago
- Fast and memory-efficient exact attention☆16,929Updated last week
- A minimal GPU design in Verilog to learn how GPUs work from the ground up☆8,211Updated 8 months ago
- Machine Learning Engineering Open Book☆13,438Updated 2 weeks ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,926Updated last week
- Development repository for the Triton language and compiler☆15,290Updated this week
- A Python framework for high performance GPU simulation and graphics☆4,957Updated this week
- DSPy: The framework for programming—not prompting—language models☆23,550Updated this week
- Tile primitives for speedy kernels☆2,259Updated this week
- CUDA Templates for Linear Algebra Subroutines☆7,294Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,981Updated this week
- NanoGPT (124M) in 3 minutes☆2,493Updated 2 weeks ago
- Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.☆8,210Updated this week
- Understanding Deep Learning - Simon J.D. Prince☆7,387Updated last week
- Train transformer language models with reinforcement learning.☆13,280Updated this week
- Schedule-Free Optimization in PyTorch☆2,142Updated last week
- Explanation to key concepts in ML☆7,538Updated this week
- The full minitorch student suite.☆2,050Updated 8 months ago
- PyTorch native post-training library☆5,103Updated this week
- Kolmogorov Arnold Networks☆15,595Updated 3 months ago
- Minimalist ML framework for Rust☆17,045Updated this week
- A PyTorch native library for large-scale model training☆3,607Updated this week
- A self-paced course to learn Rust, one exercise at a time.☆7,603Updated last month
- What would you do with 1000 H100s...☆1,035Updated last year
- PyTorch native quantization and sparsity for training and inference☆1,974Updated this week
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆11,651Updated 8 months ago
- Go ahead and axolotl questions☆9,137Updated this week