mkauers / matrix-multiplicationLinks
Matrix multiplication schemes
☆206Updated last month
Alternatives and similar repositories for matrix-multiplication
Users that are interested in matrix-multiplication are comparing it to the libraries listed below
Sorting:
- Quantum computing without the linear algebra☆77Updated 2 weeks ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- The Quasi Quantum Assembly Programming Language☆36Updated last month
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- Visualization of cache-optimized matrix multiplication☆157Updated 9 months ago
- Exocompilation for productive programming of hardware accelerators☆693Updated last week
- parallelized hyperdimensional tictactoe☆126Updated last year
- Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks☆98Updated 3 weeks ago
- GPU-accelerated compiler☆364Updated last year
- tiny code to access tenstorrent blackhole☆61Updated 6 months ago
- RDNA3 emulator☆55Updated 8 months ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆370Updated 8 months ago
- Cuq: A MIR-to-Coq Framework Targeting PTX for Formal Semantics and Verified Translation of Rust GPU Kernels☆116Updated 3 weeks ago
- HVM3☆277Updated 2 months ago
- Alex Krizhevsky's original code from Google Code☆197Updated 9 years ago
- Train neural networks that distill into logic circuits, using JAX☆63Updated 6 months ago
- A massively parallel, optimal functional runtime in Rust☆31Updated last year
- ☆109Updated last year
- Learn GPU Programming in Mojo🔥 by Solving Puzzles☆254Updated last week
- Tensor library with autograd using only Rust's standard library☆70Updated last year
- a categorical deep learning compiler☆206Updated 2 months ago
- The Cosmos numerical relativity code (with unstructured AMR)☆20Updated last year
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆174Updated 11 months ago
- noise_step: Training in 1.58b With No Gradient Memory☆221Updated 11 months ago
- A collection of study materials for AI compilers and systems.☆46Updated last month
- Competitive GPU kernel optimization platform.☆142Updated 2 weeks ago
- Learning about CUDA by writing PTX code.☆150Updated last year
- Code for the manim-generated scenes used in welch labs videos☆191Updated this week
- ☆138Updated last year
- ☆30Updated 11 months ago