jundaf2 / CUDA-INT8-GEMMView on GitHub
CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API
35Sep 15, 2023Updated 2 years ago

Alternatives and similar repositories for CUDA-INT8-GEMM

Users that are interested in CUDA-INT8-GEMM are comparing it to the libraries listed below

Sorting:

Are these results useful?