adam-maj / tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
☆7,092Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for tiny-gpu
- Solve puzzles. Learn CUDA.☆9,956Updated 2 months ago
- A lightweight library for portable low-level GPU computation using WebGPU.☆3,755Updated this week
- Solve puzzles. Improve your pytorch.☆3,286Updated 4 months ago
- LLM training in simple, raw C/CUDA☆24,492Updated last month
- Inference Llama 2 in one file of pure C☆17,482Updated 3 months ago
- Blazingly fast LLM inference.☆4,485Updated this week
- LLM101n: Let's build a Storyteller☆30,231Updated 3 months ago
- Machine Learning Engineering Open Book☆11,659Updated last week
- NanoGPT (124M) in 5 minutes☆1,269Updated this week
- Material for gpu-mode lectures☆3,028Updated this week
- CoreNet: A library for training deep neural networks☆6,984Updated last month
- From the Tensor to Stable Diffusion, a rough outline for a 9 week course.☆1,030Updated 6 months ago
- Tile primitives for speedy kernels☆1,661Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,195Updated 4 months ago
- A native PyTorch Library for large model training☆2,635Updated this week
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆10,520Updated 3 months ago
- Puzzles for learning Triton☆1,138Updated this week
- lightweight, standalone C++ inference engine for Google's Gemma models.☆5,992Updated this week
- OpenSource GPU, in Verilog, loosely based on RISC-V ISA☆828Updated 5 months ago
- Video+code lecture on building nanoGPT from scratch☆3,617Updated 3 months ago
- Pure Python from-scratch zero-dependency implementation of Bitcoin for educational purposes☆1,614Updated 3 years ago
- A computer science textbook☆3,431Updated 3 months ago
- The n-gram Language Model☆1,343Updated 3 months ago
- The full minitorch student suite.☆1,917Updated 3 months ago
- Implementation for MatMul-free LM.☆2,923Updated 2 weeks ago
- The official PyTorch implementation of Google's Gemma models☆5,292Updated 3 months ago
- Llama 2 Everywhere (L2E)☆1,512Updated last month
- A nanoGPT pipeline packed in a spreadsheet☆2,047Updated 5 months ago
- A Python framework for high performance GPU simulation and graphics☆4,270Updated this week
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆7,574Updated this week