mkauers / matrix-multiplicationLinks
Matrix multiplication schemes
☆200Updated 5 months ago
Alternatives and similar repositories for matrix-multiplication
Users that are interested in matrix-multiplication are comparing it to the libraries listed below
Sorting:
- Quantum computing without the linear algebra☆76Updated 4 months ago
- The Finite Field Assembly Programming Language☆36Updated 5 months ago
- parallelized hyperdimensional tictactoe☆125Updated last year
- Visualization of cache-optimized matrix multiplication☆156Updated 7 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- RDNA3 emulator☆54Updated 6 months ago
- GPU-accelerated compiler☆353Updated last year
- Learning about CUDA by writing PTX code.☆143Updated last year
- Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks☆90Updated last week
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- ☆73Updated 3 weeks ago
- Exocompilation for productive programming of hardware accelerators☆672Updated this week
- Learn GPU Programming in Mojo🔥 by Solving Puzzles☆150Updated last week
- tiny code to access tenstorrent blackhole☆59Updated 4 months ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆364Updated 5 months ago
- Competitive GPU kernel optimization platform.☆107Updated last week
- HVM3☆266Updated 2 weeks ago
- Alex Krizhevsky's original code from Google Code☆199Updated 9 years ago
- A massively parallel, optimal functional runtime in Rust☆31Updated last year
- Gradient descent is cool and all, but what if we could delete it?☆104Updated last month
- ☆104Updated 10 months ago
- Experimental GPU language with meta-programming☆23Updated last year
- Tensor library with autograd using only Rust's standard library☆69Updated last year
- Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers☆151Updated 10 months ago
- tenstorrent kernel from twitch☆28Updated last year
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆185Updated last year
- a categorical deep learning compiler☆204Updated 2 weeks ago
- A tiny deep learning library written in Java☆26Updated 2 years ago
- An attempt at safe imperative GPU programming.☆56Updated 2 months ago
- Code for the manim-generated scenes used in welch labs videos☆155Updated this week