luongthecong123 / fp8-quant-matmulLinks

Block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge. Additionally, this repo includes codes for quantizing Pytorch bf16 matmul with fp8.
15Updated this week

Alternatives and similar repositories for fp8-quant-matmul

Users that are interested in fp8-quant-matmul are comparing it to the libraries listed below

Sorting: