Fast and memory-efficient exact attention
☆918Dec 9, 2025Updated 4 months ago
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fork of the Triton language and compiler for Windows support and easy installation☆1,905Feb 18, 2026Updated 2 months ago
- Fast and memory-efficient exact attention - Windows wheels☆36Apr 30, 2025Updated 11 months ago
- Fork of SageAttention for Windows wheels and easy installation☆767Mar 25, 2026Updated 3 weeks ago
- Windows compile of bitsandbytes for use in text-generation-webui.☆361Nov 18, 2023Updated 2 years ago
- Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossi…☆132Mar 24, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Fast and memory-efficient exact attention☆23,344Updated this week
- Fast and memory-efficient exact attention☆50Mar 24, 2026Updated 3 weeks ago
- ☆195Jul 31, 2024Updated last year
- Pre-compiled Python whl for Flash-attention, SageAttention, NATTEN, xFormer etc☆594Apr 1, 2026Updated 2 weeks ago
- A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.☆2,982Jan 30, 2026Updated 2 months ago
- ☆478Oct 30, 2024Updated last year
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models☆3,799Mar 7, 2026Updated last month
- GGUF Quantization support for native ComfyUI models☆3,478Jan 12, 2026Updated 3 months ago
- for tile the image for advanced control or modification☆972Jan 8, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Support for miscellaneous image models. Currently supports: DiT, PixArt, T5 and a few custom VAEs☆18May 20, 2024Updated last year
- Diffusers wrapper to run Kwai-Kolors model☆598Oct 18, 2024Updated last year
- Support for miscellaneous image models. Currently supports: DiT, PixArt, HunYuanDiT, MiaoBi, and a few VAEs.☆534Dec 17, 2024Updated last year
- ☆109Dec 20, 2024Updated last year
- [ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-t…☆3,296Jan 17, 2026Updated 3 months ago
- ☆66Mar 16, 2026Updated last month
- ComfyUI Node☆716Jun 18, 2025Updated 10 months ago
- Inference Microsoft Florence2 VLM☆1,660Apr 8, 2026Updated last week
- ☆186Dec 24, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- https://wavespeed.ai/ [WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.☆1,224Aug 2, 2025Updated 8 months ago
- ComfyUI BrushNet nodes☆942Mar 31, 2025Updated last year
- ☆239May 22, 2024Updated last year
- Recommended based on comfyui node pictures:Joy_caption + MiniCPMv2_6-prompt-generator + florence2☆623Feb 6, 2025Updated last year
- This tool allows you to process multiple images simultaneously, including removing metadata and alpha channels from the images. / 本ツールは、複…☆10Dec 20, 2023Updated 2 years ago
- ☆162Mar 19, 2025Updated last year
- SUPIR upscaling wrapper for ComfyUI☆2,255Mar 15, 2026Updated last month
- ComfyUI Plugin of Nunchaku☆2,844Feb 19, 2026Updated 2 months ago
- a set of utils for comfyui lora operation☆30Jan 6, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆6,305Feb 22, 2026Updated last month
- ☆517Apr 26, 2025Updated 11 months ago
- This is a pre-built wheel of Triton 3.3.0 for Windows with Nvidia only + Proton☆40May 18, 2025Updated 11 months ago
- Generate detailed image descriptions and analysis using Molmo models in ComfyUI.☆140Oct 14, 2024Updated last year
- ComfyUI nodes to use segment-anything-2☆1,181Sep 28, 2025Updated 6 months ago
- nunchaku0.3.0.dev2的各种轮子列表☆41May 20, 2025Updated 10 months ago
- Accelerate inference in Flux and Sana for ComfyUI.☆222Mar 13, 2025Updated last year