enp1s0 / ozIMMULinks
FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme
☆93Updated 8 months ago
Alternatives and similar repositories for ozIMMU
Users that are interested in ozIMMU are comparing it to the libraries listed below
Sorting:
- An extension library of WMMA API (Tensor Core API)☆109Updated last year
- ☆50Updated last year
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆210Updated last month
- AMD’s C++ library for accelerating tensor primitives☆46Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆122Updated 3 weeks ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆332Updated 2 weeks ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 8 months ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆90Updated 2 years ago
- ☆52Updated 7 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo