DefTruth / ffpa-attn-mma

📚FFPA(Split-D): Yet another Faster Flash Prefill Attention with O(1) GPU SRAM complexity for headdim > 256, ~2x↑🎉vs SDPA EA.
147Updated this week

Alternatives and similar repositories for ffpa-attn-mma:

Users that are interested in ffpa-attn-mma are comparing it to the libraries listed below