DefTruth / ffpa-attn-mma

📚[WIP] FFPA: Yet antother Faster Flash Prefill Attention with O(1)🎉GPU SRAM complexity for headdim > 256, 1.8x~3x↑🎉faster vs SDPA EA.
44Updated this week

Alternatives and similar repositories for ffpa-attn-mma:

Users that are interested in ffpa-attn-mma are comparing it to the libraries listed below