geohot / gpunoobLinks
Noob Lessons from Stream about how GPUs work
☆135Updated 8 months ago
Alternatives and similar repositories for gpunoob
Users that are interested in gpunoob are comparing it to the libraries listed below
Sorting:
- parallelized hyperdimensional tictactoe☆126Updated last year
- ☆97Updated last week
- Solve puzzles to improve your tinygrad skills!☆174Updated 2 months ago
- An implementation of delta-iris in tinygrad☆72Updated last year
- Tutorials on tinygrad☆448Updated 2 months ago
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated last year
- Can RL solve simple problems?☆54Updated last year
- Tensor library with autograd using only Rust's standard library☆71Updated last year
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆225Updated last year
- The simplest way to run LLMs anywhere☆106Updated last year
- Learnings and programs related to CUDA☆432Updated 6 months ago
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆137Updated last month
- Learning about CUDA by writing PTX code.☆151Updated last year
- Can you design a controller to steer a simulated car?☆336Updated 5 months ago
- GPT-2 in C☆79Updated last year
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆175Updated 11 months ago
- pytorch from scratch in pure C/CUDA and python☆41Updated last year
- An implement of deep learning framework and models in C☆48Updated 9 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆664Updated last week
- my little linear algebra library☆44Updated last year
- Competitive GPU kernel optimization platform.☆144Updated this week
- FP4 MAC Array☆19Updated last year
- Learn GPU Programming in Mojo🔥 by Solving Puzzles☆266Updated 2 weeks ago
- Simple MPI implementation for prototyping or learning☆297Updated 5 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- SIMD quantization kernels☆93Updated 4 months ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆372Updated 8 months ago
- If tinygrad wasn't small enough for you...☆761Updated last year
- Alex Krizhevsky's original code from Google Code☆197Updated 9 years ago