geohot / gpunoobLinks
Noob Lessons from Stream about how GPUs work
☆133Updated 7 months ago
Alternatives and similar repositories for gpunoob
Users that are interested in gpunoob are comparing it to the libraries listed below
Sorting:
- parallelized hyperdimensional tictactoe☆126Updated last year
- Solve puzzles to improve your tinygrad skills!☆164Updated 2 months ago
- ☆97Updated last week
- An implementation of delta-iris in tinygrad☆72Updated last year
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- Tutorials on tinygrad☆444Updated 2 months ago
- Tensor library with autograd using only Rust's standard library☆70Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆277Updated last year
- The simplest way to run LLMs anywhere☆106Updated last year
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆226Updated last year
- Can RL solve simple problems?☆54Updated last year
- Learnings and programs related to CUDA☆428Updated 5 months ago
- Competitive GPU kernel optimization platform.☆141Updated last week
- work @ comma.ai☆174Updated last year
- Can you design a controller to steer a simulated car?☆329Updated 4 months ago
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆135Updated last month
- (WIP) A small but powerful, homemade PyTorch from scratch.☆661Updated this week
- If tinygrad wasn't small enough for you...☆759Updated last year
- Learn GPU Programming in Mojo🔥 by Solving Puzzles☆249Updated last week
- GPT-2 in C☆77Updated 11 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆602Updated 5 months ago
- RDNA3 emulator☆55Updated 7 months ago
- Quantized LLM training in pure CUDA/C++.☆221Updated this week
- my little linear algebra library☆44Updated last year
- A light tensor library in zig.☆78Updated 10 months ago
- SIMD quantization kernels☆93Updated 3 months ago
- An implement of deep learning framework and models in C☆48Updated 8 months ago
- Learning about CUDA by writing PTX code.☆149Updated last year
- pytorch from scratch in pure C/CUDA and python☆41Updated last year
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆368Updated 7 months ago