srush / GPU-PuzzlesLinks
Solve puzzles. Learn CUDA.
☆11,616Updated last year
Alternatives and similar repositories for GPU-Puzzles
Users that are interested in GPU-Puzzles are comparing it to the libraries listed below
Sorting:
- Solve puzzles. Improve your pytorch.☆3,771Updated last year
- Material for gpu-mode lectures☆5,257Updated last month
- Puzzles for learning Triton☆2,105Updated 11 months ago
- The full minitorch student suite.☆2,214Updated last year
- GPU programming related news and material links☆1,764Updated last month
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆13,646Updated last year
- Tile primitives for speedy kernels☆2,873Updated this week
- Machine Learning Engineering Open Book☆15,674Updated 2 weeks ago
- This project is a stock trend prediction web application created using Python and Streamlit. The purpose of this web application is to al…☆10Updated 2 years ago
- Development repository for the Triton language and compiler☆17,529Updated this week
- A PyTorch native platform for training generative AI models☆4,675Updated this week
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆8,742Updated this week
- A Python framework for accelerated simulation, data generation and spatial computing.☆5,754Updated last week
- LLM training in simple, raw C/CUDA☆28,081Updated 4 months ago
- llama3 implementation one matrix multiplication at a time☆15,195Updated last year
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,617Updated 2 months ago
- NanoGPT (124M) in 3 minutes☆3,785Updated last week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,116Updated last year
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,824Updated 4 months ago
- Understanding Deep Learning - Simon J.D. Prince☆8,484Updated last week
- A lightweight library for portable low-level GPU computation using WebGPU.☆3,915Updated last month
- What would you do with 1000 H100s...☆1,121Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,146Updated 2 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,890Updated 2 months ago
- An ML Systems Onboarding list☆928Updated 9 months ago
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆30,546Updated this week
- CUDA Python: Performance meets Productivity☆3,025Updated this week
- Inference Llama 2 in one file of pure C☆18,912Updated last year
- ☆2,007Updated last week
- Neural Networks: Zero to Hero☆18,487Updated last year