aredden / flux-fp8-api
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
☆255Updated 5 months ago
Alternatives and similar repositories for flux-fp8-api:
Users that are interested in flux-fp8-api are comparing it to the libraries listed below
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆334Updated last month
- Context parallel attention that accelerates DiT model inference with dynamic caching☆228Updated this week
- ☆270Updated 2 months ago
- ☆124Updated 3 weeks ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆192Updated last month
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆447Updated 3 months ago
- ☆426Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆391Updated last month
- faster parallel inference of mochi-1 video generation model☆112Updated last month
- Cog inference for flux models☆332Updated last week
- Generate long weighted prompt embeddings for Stable Diffusion☆110Updated 6 months ago
- Faster generation with text-to-image diffusion models.☆211Updated 5 months ago
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆569Updated 2 weeks ago
- See original repo here: https://github.com/google/RB-Modulation - ICLR 2025 (Oral)☆125Updated 7 months ago
- IP Adapter Instruct☆202Updated 7 months ago
- Training-free Regional Prompting for Diffusion Transformers 🔥☆591Updated 4 months ago
- Accelerates Flux.1 image generation, just by using this node.☆127Updated 3 months ago
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆174Updated 11 months ago
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆372Updated last week
- Various training scripts used to train bigasp☆77Updated 5 months ago
- Enhance-A-Video: Better Generated Video for Free☆483Updated last week
- The best OSS video generation models☆132Updated 5 months ago
- Implicit Style-Content Separation using B-LoRA☆358Updated 4 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆205Updated 3 months ago
- ☆430Updated last year
- ☆126Updated 5 months ago
- InstantID-ROME: Improved Identity-Preserving Generation in Seconds 🔥☆221Updated 10 months ago
- ☆449Updated 4 months ago
- A set of ComfyUI nodes providing additional control for the LTX Video model☆472Updated 3 weeks ago
- ☆144Updated last month