mkauers / matrix-multiplicationLinks
Matrix multiplication schemes
☆206Updated 3 months ago
Alternatives and similar repositories for matrix-multiplication
Users that are interested in matrix-multiplication are comparing it to the libraries listed below
Sorting:
- Quantum computing without the linear algebra☆78Updated last month
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- The Quasi Quantum Assembly Programming Language☆36Updated 2 months ago
- Visualization of cache-optimized matrix multiplication☆157Updated 10 months ago
- Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks☆98Updated 2 weeks ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆376Updated 9 months ago
- parallelized hyperdimensional tictactoe☆126Updated last year
- RDNA3 emulator☆55Updated 9 months ago
- Exocompilation for productive programming of hardware accelerators☆703Updated last week
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- tiny code to access tenstorrent blackhole☆61Updated 8 months ago
- ☆30Updated last year
- Cuq: A MIR-to-Coq Framework Targeting PTX for Formal Semantics and Verified Translation of Rust GPU Kernels☆124Updated last month
- Tensor library with autograd using only Rust's standard library☆71Updated last year
- Learning about CUDA by writing PTX code.☆151Updated last year
- A massively parallel, optimal functional runtime in Rust☆31Updated last year
- Tensor library & inference framework for machine learning☆117Updated 3 months ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆194Updated last year
- Competitive GPU kernel optimization platform.☆146Updated last week
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA