srush / Tensor-Puzzles
Solve puzzles. Improve your pytorch.
☆3,282Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for Tensor-Puzzles
- Solve puzzles. Learn CUDA.☆9,933Updated 2 months ago
- Puzzles for learning Triton☆1,135Updated this week
- What would you do with 1000 H100s...☆903Updated 10 months ago
- The full minitorch student suite.☆1,917Updated 3 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,679Updated this week
- A native PyTorch Library for large model training☆2,623Updated this week
- Language model alignment-focused deep learning curriculum☆1,269Updated 3 months ago
- An autoregressive character-level language model for making more things☆2,607Updated 5 months ago
- 🧠 A study guide to learn about Transformers☆1,541Updated last year
- ☆391Updated last month
- GPU programming related news and material links☆1,237Updated last month
- Schedule-Free Optimization in PyTorch☆1,898Updated 2 weeks ago
- Tile primitives for speedy kernels☆1,658Updated this week
- Video+code lecture on building nanoGPT from scratch☆3,611Updated 3 months ago
- NanoGPT (124M) quality in 7.8 8xH100-minutes☆1,033Updated this week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆715Updated last month
- Machine Learning Engineering Open Book☆11,655Updated last week
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,223Updated last year
- arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors …☆1,178Updated last year
- UNet diffusion model in pure CUDA☆584Updated 4 months ago
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆10,520Updated 3 months ago
- Puzzles for exploring transformers☆325Updated last year
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,199Updated this week
- Collection of important articles to be treated as a textbook☆612Updated 7 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,669Updated last month
- High Quality Resources on GPU Programming/Architecture☆566Updated 3 months ago
- Material for gpu-mode lectures☆3,028Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,195Updated 4 months ago
- Notebooks and various random fun☆1,079Updated last year
- Tensors, for human consumption☆1,113Updated 3 weeks ago