obadakhalili / tinygrad-tensor-puzzles
Solve puzzles to improve your tinygrad skills!
☆70Updated last month
Related projects: ⓘ
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆91Updated 2 months ago
- Tutorials on tinygrad☆157Updated 2 weeks ago
- Simple Transformer in Jax☆100Updated 2 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆165Updated last month
- could we make an ml stack in 100,000 lines of code?☆22Updated 2 months ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆110Updated last month
- A really tiny autograd engine☆85Updated 5 months ago
- The Tensor (or Array)☆388Updated last month
- Tensor library with autograd using only Rust's standard library☆61Updated 2 months ago
- parallelized hyperdimensional tictactoe☆105Updated 3 weeks ago
- High Quality Resources on GPU Programming/Architecture☆561Updated last month
- ☆97Updated 5 months ago
- Solve Puzzles. Learn Metal 🤘☆87Updated this week
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆45Updated 3 months ago
- GPT-2 (124M) quality in 5B tokens☆227Updated last week
- Alex Krizhevsky's original code from Google Code☆185Updated 8 years ago
- ☆52Updated last week
- Solve puzzles. Learn CUDA.☆53Updated 9 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆152Updated 11 months ago
- ☆27Updated 2 months ago
- Gradient descent is cool and all, but what if we could delete it?☆99Updated 2 months ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆115Updated 2 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆246Updated 3 months ago
- Fast bare-bones BPE for modern tokenizer training☆138Updated 3 weeks ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆88Updated 11 months ago
- a tiny vectorstore implementation built with numpy.☆50Updated 4 months ago
- An ML Systems Onboarding list☆491Updated last month
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆59Updated 2 months ago
- UNet diffusion model in pure CUDA☆562Updated 2 months ago
- The history files when recording human interaction while solving ARC tasks☆91Updated this week