dshah3 / GPU-PuzzlesLinks
Solve puzzles. Learn CUDA.
☆64Updated last year
Alternatives and similar repositories for GPU-Puzzles
Users that are interested in GPU-Puzzles are comparing it to the libraries listed below
Sorting:
- ☆78Updated 10 months ago
- ☆156Updated last year
- ☆88Updated last year
- supporting pytorch FSDP for optimizers☆79Updated 5 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆183Updated last year
- A puzzle to learn about prompting☆127Updated 2 years ago
- Experiment of using Tangent to autodiff triton☆78Updated last year
- seqax = sequence modeling + JAX☆155Updated last month
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆133Updated last year
- Puzzles for exploring transformers☆347Updated 2 years ago
- ☆431Updated 7 months ago
- ☆262Updated 10 months ago
- A really tiny autograd engine☆95Updated this week
- Custom triton kernels for training Karpathy's nanoGPT.☆19Updated 7 months ago
- ☆53Updated last year
- ☆210Updated this week
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆67Updated 2 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆182Updated last week
- An interactive exploration of Transformer programming.☆264Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆127Updated last year
- train with kittens!☆57Updated 7 months ago
- ring-attention experiments☆143Updated 7 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated last month
- JAX implementation of the Llama 2 model☆217Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆44Updated this week
- Mixed precision training from scratch with Tensors and CUDA☆23Updated last year
- ML/DL Math and Method notes☆61Updated last year
- Load compute kernels from the Hub☆134Updated this week
- Custom kernels in Triton language for accelerating LLMs☆19Updated last year
- ☆112Updated last week