sdbds / SageAttention-for-windows

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
12Updated last week

Alternatives and similar repositories for SageAttention-for-windows:

Users that are interested in SageAttention-for-windows are comparing it to the libraries listed below