Fast and memory-efficient exact attention
☆933Dec 9, 2025Updated 5 months ago
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fork of the Triton language and compiler for Windows support and easy installation☆1,921Feb 18, 2026Updated 2 months ago
- Fast and memory-efficient exact attention - Windows wheels☆36Apr 30, 2025Updated last year
- Fork of SageAttention for Windows wheels and easy installation☆793Mar 25, 2026Updated last month
- Windows compile of bitsandbytes for use in text-generation-webui.☆361Nov 18, 2023Updated 2 years ago
- Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossi…☆138Mar 24, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fast and memory-efficient exact attention☆23,628May 3, 2026Updated last week
- Fast and memory-efficient exact attention☆53Mar 24, 2026Updated last month
- ☆195Jul 31, 2024Updated last year
- Pre-compiled Python whl for Flash-attention, SageAttention, NATTEN, xFormer etc☆617Apr 1, 2026Updated last month
- A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.☆3,012Jan 30, 2026Updated 3 months ago
- ☆478Oct 30, 2024Updated last year
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models☆3,832Mar 7, 2026Updated 2 months ago
- GGUF Quantization support for native ComfyUI models☆3,582Jan 12, 2026Updated 3 months ago
- for tile the image for advanced control or modification☆977Jan 8, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Support for miscellaneous image models. Currently supports: DiT, PixArt, T5 and a few custom VAEs☆18May 20, 2024Updated last year
- Diffusers wrapper to run Kwai-Kolors model☆598Oct 18, 2024Updated last year
- Support for miscellaneous image models. Currently supports: DiT, PixArt, HunYuanDiT, MiaoBi, and a few VAEs.☆534Dec 17, 2024Updated last year
- ☆111Dec 20, 2024Updated last year
- [ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-t…☆3,342Jan 17, 2026Updated 3 months ago
- ☆66Mar 16, 2026Updated last month
- ComfyUI Node☆718Jun 18, 2025Updated 10 months ago
- Inference Microsoft Florence2 VLM☆1,679Apr 18, 2026Updated 3 weeks ago
- ☆187Dec 24, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ComfyUI BrushNet nodes☆943Mar 31, 2025Updated last year
- ☆238May 22, 2024Updated last year
- https://wavespeed.ai/ [WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.☆1,227Aug 2, 2025Updated 9 months ago
- Recommended based on comfyui node pictures:Joy_caption + MiniCPMv2_6-prompt-generator + florence2☆624Feb 6, 2025Updated last year
- This tool allows you to process multiple images simultaneously, including removing metadata and alpha channels from the images. / 本ツールは、複…☆10Dec 20, 2023Updated 2 years ago
- ☆163Mar 19, 2025Updated last year
- SUPIR upscaling wrapper for ComfyUI☆2,265Apr 29, 2026Updated last week
- ComfyUI Plugin of Nunchaku☆2,868Feb 19, 2026Updated 2 months ago
- a set of utils for comfyui lora operation☆31Apr 14, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆6,382Updated this week
- ☆519Apr 26, 2025Updated last year
- This is a pre-built wheel of Triton 3.3.0 for Windows with Nvidia only + Proton☆42May 18, 2025Updated 11 months ago
- ComfyUI nodes to use segment-anything-2☆1,188Sep 28, 2025Updated 7 months ago
- Generate detailed image descriptions and analysis using Molmo models in ComfyUI.☆139Oct 14, 2024Updated last year
- Accelerate inference in Flux and Sana for ComfyUI.☆222Mar 13, 2025Updated last year
- ComfyUI-UniversalBlockSwap☆51Sep 18, 2025Updated 7 months ago