srush / GPU-PuzzlesLinks
Solve puzzles. Learn CUDA.
☆11,932Updated last year
Alternatives and similar repositories for GPU-Puzzles
Users that are interested in GPU-Puzzles are comparing it to the libraries listed below
Sorting:
- Solve puzzles. Improve your pytorch.☆3,912Updated last year
- Material for gpu-mode lectures☆5,679Updated last week
- NanoGPT (124M) in 2 minutes☆4,589Updated last week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,293Updated last year
- GPU programming related news and material links☆1,955Updated 4 months ago
- The full minitorch student suite.☆2,292Updated last year
- Puzzles for learning Triton☆2,283Updated last year
- A PyTorch native platform for training generative AI models☆5,045Updated this week
- A minimal GPU design in Verilog to learn how GPUs work from the ground up☆11,290Updated last year
- Tile primitives for speedy kernels☆3,120Updated this week
- Development repository for the Triton language and compiler☆18,319Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,702Updated 3 weeks ago
- llama3 implementation one matrix multiplication at a time☆15,241Updated last year
- Video+code lecture on building nanoGPT from scratch☆4,719Updated last year
- Neural Networks: Zero to Hero☆20,161Updated last year
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆14,546Updated last year
- LLM training in simple, raw C/CUDA☆28,763Updated 7 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,183Updated 5 months ago
- An autoregressive character-level language model for making more things☆3,635Updated last year
- A Python framework for accelerated simulation, data generation and spatial computing.☆6,191Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,058Updated 5 months ago
- ☆4,112Updated last year
- Machine Learning Engineering Open Book☆16,586Updated 2 weeks ago
- Understanding Deep Learning - Simon J.D. Prince☆9,051Updated 2 weeks ago
- What would you do with 1000 H100s...☆1,151Updated 2 years ago
- ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Libr…☆3,325Updated last year
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,860Updated 7 months ago
- Fast and memory-efficient exact attention☆22,113Updated this week
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,387Updated 2 weeks ago
- Schedule-Free Optimization in PyTorch☆2,254Updated 8 months ago