weishengying / cutlass_flash_atten_fp8View on GitHub
使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention
81Aug 12, 2024Updated last year

Alternatives and similar repositories for cutlass_flash_atten_fp8

Users that are interested in cutlass_flash_atten_fp8 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?