aredden / flux-fp8-apiLinks
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
☆286Updated last year
Alternatives and similar repositories for flux-fp8-api
Users that are interested in flux-fp8-api are comparing it to the libraries listed below
Sorting:
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆388Updated 6 months ago
- ☆282Updated 11 months ago
- Generate long weighted prompt embeddings for Stable Diffusion☆146Updated 8 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆403Updated 5 months ago
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆643Updated 9 months ago
- ☆164Updated this week
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆415Updated 9 months ago
- ☆126Updated 9 months ago
- ☆442Updated last year
- Faster generation with text-to-image diffusion models.☆231Updated 5 months ago
- Cog inference for flux models☆367Updated 4 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆210Updated 2 months ago
- Official Repository of the paper "Trajectory Consistency Distillation"☆356Updated last year
- The best OSS video generation models☆135Updated last year
- Various training scripts used to train bigasp☆110Updated 4 months ago
- See original repo here: https://github.com/google/RB-Modulation - ICLR 2025 (Oral)☆126Updated last year
- AuraSR: GAN-based Super-Resolution for real-world☆511Updated last year
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆403Updated 9 months ago
- Accelerates Flux.1 image generation, just by using this node.☆140Updated last year
- ☆160Updated 10 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆255Updated 11 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆507Updated last year
- faster parallel inference of mochi-1 video generation model☆126Updated 9 months ago
- ☆447Updated last year
- IP Adapter Instruct☆211Updated last year
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆179Updated last year
- Text and image to video generation: Kandinsky 4.0 (2024)☆149Updated last year
- Implicit Style-Content Separation using B-LoRA☆394Updated last year
- InstantID-ROME: Improved Identity-Preserving Generation in Seconds 🔥☆235Updated last year
- Qwen-Image text to image lora trainer☆652Updated last week