adam-maj / tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
☆7,340Updated 5 months ago
Alternatives and similar repositories for tiny-gpu:
Users that are interested in tiny-gpu are comparing it to the libraries listed below
- Solve puzzles. Learn CUDA.☆10,335Updated 4 months ago
- Material for gpu-mode lectures☆3,501Updated last week
- LLM training in simple, raw C/CUDA☆25,047Updated 3 months ago
- A lightweight library for portable low-level GPU computation using WebGPU.☆3,791Updated 2 weeks ago
- Implementation for MatMul-free LM.☆2,941Updated 2 months ago
- Machine Learning Engineering Open Book☆12,353Updated this week
- Tile primitives for speedy kernels☆1,923Updated this week
- Solve puzzles. Improve your pytorch.☆3,359Updated 6 months ago
- GPU programming related news and material links☆1,312Updated last week
- Open-source high-performance RISC-V processor☆5,921Updated this week
- A PyTorch native library for large model training☆3,091Updated this week
- Development repository for the Triton language and compiler☆14,042Updated this week
- A nanoGPT pipeline packed in a spreadsheet☆2,059Updated 7 months ago
- Puzzles for learning Triton☆1,300Updated last month
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,749Updated last month
- OpenSource GPU, in Verilog, loosely based on RISC-V ISA☆877Updated last month
- RISC-V Linux SoC, marchID: 0x2b☆749Updated this week
- ☆1,207Updated 3 months ago
- CUDA Templates for Linear Algebra Subroutines☆5,999Updated last week
- A self-paced course to learn Rust, one exercise at a time.☆6,956Updated last week
- A list of awesome compiler projects and papers for tensor computation and deep learning.☆2,461Updated 2 months ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,462Updated this week
- The book "Performance Analysis and Tuning on Modern CPU"☆2,681Updated 3 weeks ago
- A Python framework for high performance GPU simulation and graphics☆4,450Updated this week
- ☆1,012Updated last month
- On-device AI across mobile, embedded and edge for PyTorch☆2,407Updated this week
- Inference Llama 2 in one file of pure C☆17,858Updated 5 months ago
- The full minitorch student suite.☆1,985Updated 5 months ago
- tiniest x86-64-linux emulator☆7,050Updated 3 months ago
- Blazingly fast LLM inference.☆4,826Updated this week