weishengying / cutlass_flash_atten_fp8

使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention
60Updated 7 months ago

Alternatives and similar repositories for cutlass_flash_atten_fp8:

Users that are interested in cutlass_flash_atten_fp8 are comparing it to the libraries listed below