NonvolatileMemory / flash_attn_gqaView on GitHub
triton ver of gqa flash attn, based on the tutorial
12Aug 4, 2024Updated last year

Alternatives and similar repositories for flash_attn_gqa

Users that are interested in flash_attn_gqa are comparing it to the libraries listed below

Sorting:

Are these results useful?