jundaf2 / CUDA-INT8-GEMM

CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API
27Updated last year

Alternatives and similar repositories for CUDA-INT8-GEMM:

Users that are interested in CUDA-INT8-GEMM are comparing it to the libraries listed below