Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
☆132Mar 24, 2026Updated 3 weeks ago
Alternatives and similar repositories for SageAttention-for-windows
Users that are interested in SageAttention-for-windows are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fork of SageAttention for Windows wheels and easy installation☆767Mar 25, 2026Updated 3 weeks ago
- This is a pre-built wheel of Triton 3.3.0 for Windows with Nvidia only + Proton☆40May 18, 2025Updated 11 months ago
- A simple aesthetic scorer + pruner + website you can run to view the results from the scoring with☆16Jun 3, 2024Updated last year
- Privacy Covers for Load image, preview image and Save image nodes in comfyUI