mkauers / matrix-multiplicationLinks
Matrix multiplication schemes
☆195Updated 2 months ago
Alternatives and similar repositories for matrix-multiplication
Users that are interested in matrix-multiplication are comparing it to the libraries listed below
Sorting:
- Quantum computing without the linear algebra☆71Updated last month
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- Visualization of cache-optimized matrix multiplication☆152Updated 4 months ago
- The Finite Field Assembly Programming Language☆36Updated last month
- Custom PTX Instruction Benchmark☆126Updated 4 months ago
- parallelized hyperdimensional tictactoe☆118Updated 10 months ago
- Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks☆88Updated 2 months ago
- RDNA3 emulator☆54Updated 3 months ago
- Alex Krizhevsky's original code from Google Code☆194Updated 9 years ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated 11 months ago
- Tensor library with autograd using only Rust's standard library☆68Updated last year
- Learning about CUDA by writing PTX code.☆133Updated last year
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆350Updated 2 months ago
- Learn GPU Programming in Mojo🔥 by Solving Puzzles☆87Updated last week
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆120Updated 6 months ago
- A massively parallel, optimal functional runtime in Rust☆31Updated 11 months ago
- FP4 MAC Array☆19Updated last year
- a categorical deep learning compiler☆203Updated 4 months ago
- An implementation of delta-iris in tinygrad☆72Updated 10 months ago
- ☆96Updated 7 months ago
- ☆30Updated 6 months ago
- Gradient descent is cool and all, but what if we could delete it?☆104Updated this week
- LLM training in simple, raw C/CUDA☆99Updated last year
- ☆70Updated last month
- High-Performance SGEMM on CUDA devices☆97Updated 5 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆196Updated last month
- tiny code to access tenstorrent blackhole☆55Updated last month
- Competitive GPU kernel optimization platform.☆86Updated this week
- ☆138Updated last year
- tenstorrent kernel from twitch☆28Updated last year