chengzeyi / stable-fast
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
☆1,244Updated this week
Alternatives and similar repositories for stable-fast:
Users that are interested in stable-fast are comparing it to the libraries listed below
- [CVPR 2024] DeepCache: Accelerating Diffusion Models for Free☆871Updated 9 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆670Updated 3 months ago
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models☆1,033Updated this week
- xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism☆1,663Updated last week
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,781Updated 4 months ago
- Tiny AutoEncoder for Stable Diffusion☆664Updated last week
- Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty…☆556Updated last year
- Model Compression Toolbox for Large Language Models and Diffusion Models☆394Updated last month
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆334Updated last month
- Segmind Distilled diffusion☆593Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,176Updated 8 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,013Updated 4 months ago
- The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥☆798Updated 11 months ago
- Context parallel attention that accelerates DiT model inference with dynamic caching☆228Updated this week
- ✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL☆1,095Updated last year
- FastVideo is a lightweight framework for accelerating large video diffusion models.☆1,264Updated this week
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆255Updated 5 months ago
- Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossi…☆1,185Updated this week
- Speed up Stable Diffusion with this one simple trick!☆1,331Updated last year
- ☆426Updated last year
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!☆808Updated 3 months ago
- A prompting enhancement library for transformers-type text embedding systems☆565Updated 2 months ago
- ☆270Updated 2 months ago
- ☆591Updated 5 months ago
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆569Updated 2 weeks ago
- VideoSys: An easy and efficient system for video generation☆1,944Updated 2 weeks ago
- Faster generation with text-to-image diffusion models.☆211Updated 5 months ago
- ☆432Updated 4 months ago
- [ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.☆347Updated last year
- [CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model☆765Updated 7 months ago