adam-maj / tiny-gpuLinks
A minimal GPU design in Verilog to learn how GPUs work from the ground up
☆11,290Updated last year
Alternatives and similar repositories for tiny-gpu
Users that are interested in tiny-gpu are comparing it to the libraries listed below
Sorting:
- LLM training in simple, raw C/CUDA☆28,763Updated 7 months ago
- Solve puzzles. Learn CUDA.☆11,932Updated last year
- A lightweight library for portable low-level GPU computation using WebGPU.☆3,941Updated 4 months ago
- OpenSource GPU, in Verilog, loosely based on RISC-V ISA☆1,239Updated last year
- A PyTorch native platform for training generative AI models☆5,045Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,293Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,183Updated 5 months ago
- A Python framework for accelerated simulation, data generation and spatial computing.☆6,191Updated this week
- Material for gpu-mode lectures☆5,679Updated last week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,956Updated this week
- Machine Learning Engineering Open Book☆16,586Updated 2 weeks ago
- A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1☆1,161Updated 5 months ago
- ☆1,885Updated this week
- Puzzles for learning Triton☆2,283Updated last year
- llama3 implementation one matrix multiplication at a time☆15,241Updated last year
- Tile primitives for speedy kernels☆3,120Updated this week
- Solve puzzles. Improve your pytorch.☆3,912Updated last year
- ☆4,112Updated last year
- ☆1,585Updated this week
- Inference Llama 2 in one file of pure C☆19,146Updated last year
- Open-source high-performance RISC-V processor☆6,867Updated this week
- Video+code lecture on building nanoGPT from scratch☆4,719Updated last year
- GPU programming related news and material links☆1,955Updated 4 months ago
- An ML Systems Onboarding list☆981Updated last year
- CoreNet: A library for training deep neural networks☆7,016Updated 4 months ago
- A deep-dive on the entire history of deep-learning☆1,521Updated last year
- NanoGPT (124M) in 2 minutes☆4,589Updated last week
- Development repository for the Triton language and compiler☆18,319Updated last week
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,074Updated 4 months ago
- Tensor library for machine learning☆13,923Updated this week