arpitingle / gpu-alpha
High Quality Resources on GPU Programming/Architecture
☆566Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for gpu-alpha
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆174Updated this week
- From the Tensor to Stable Diffusion, a rough outline for a 9 week course.☆1,030Updated 6 months ago
- An ML Systems Onboarding list☆545Updated this week
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆167Updated 3 months ago
- learningggggggg 🐳☆121Updated last week
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆219Updated 3 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆170Updated last month
- Solve puzzles to improve your tinygrad skills!☆87Updated 2 months ago
- The Tensor (or Array)☆411Updated 3 months ago
- UNet diffusion model in pure CUDA☆584Updated 4 months ago
- Tutorials on tinygrad☆180Updated last week
- Solve Puzzles. Learn Metal 🤘☆326Updated last month
- A really tiny autograd engine☆87Updated 7 months ago
- Intro to leetcodes. Basic techniques, quicksort and hash structures implementation, space and time complexities.☆95Updated 3 months ago
- The Multilayer Perceptron Language Model☆523Updated 3 months ago
- The Autograd Engine☆534Updated 2 months ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆116Updated 3 months ago
- From the Transistor to the Web Browser, a rough outline for a 12 week course☆127Updated 6 months ago
- ☆47Updated 3 months ago
- Alex Krizhevsky's original code from Google Code☆190Updated 8 years ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆715Updated last month
- could we make an ml stack in 100,000 lines of code?☆26Updated 4 months ago
- Because tinygrad got out of hand with line count☆145Updated last month
- Simple Transformer in Jax☆119Updated 4 months ago
- Deep Learning resources☆122Updated last year
- If tinygrad wasn't small enough for you...☆654Updated 8 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆107Updated last year
- ☆99Updated 7 months ago
- LLM papers I'm reading, mostly on inference and model compression☆694Updated 10 months ago
- parallelized hyperdimensional tictactoe☆110Updated 2 months ago