Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
☆284Oct 12, 2024Updated last year
Alternatives and similar repositories for flux-fp8-api
Users that are interested in flux-fp8-api are comparing it to the libraries listed below
Sorting:
- Cog inference for flux models☆369Jul 31, 2025Updated 7 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆395Jan 8, 2026Updated last month
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Dec 3, 2024Updated last year
- A general fine-tuning kit geared toward image/video/audio diffusion models.☆2,766Feb 21, 2026Updated last week
- A unified benchmarking framework for generative styling models in PyTorch☆14Oct 27, 2024Updated last year
- ☆79Dec 27, 2024Updated last year
- ☆33Nov 4, 2024Updated last year
- ☆16Apr 23, 2024Updated last year
- ☆20Jun 26, 2024Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated last month
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models☆3,703Feb 14, 2026Updated 2 weeks ago
- Accelerates Flux.1 image generation, just by using this node.☆140Dec 19, 2024Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper☆34Sep 17, 2024Updated last year
- OneDiff: An out-of-the-box acceleration library for diffusion models.☆1,970Dec 4, 2025Updated 2 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆212Sep 27, 2025Updated 5 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆215Sep 27, 2025Updated 5 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆150Dec 17, 2024Updated last year
- https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.☆1,301Mar 27, 2025Updated 11 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆424Jul 5, 2025Updated 7 months ago
- ☆48Feb 23, 2025Updated last year
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- Dungeon procedural generator similar to whatabou's "One Page Dungeon"☆50Jan 4, 2026Updated last month
- Rectified Flow Inversion (RF-Inversion) - ICLR 2025☆469Mar 19, 2025Updated 11 months ago
- ☆2,230Nov 8, 2024Updated last year
- LayerDiffuse in pure diffusers without any GUI☆415Jun 16, 2024Updated last year
- Implicit Style-Content Separation using B-LoRA☆396Nov 14, 2024Updated last year
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆18Dec 18, 2024Updated last year
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆403Mar 19, 2025Updated 11 months ago
- Concept Sliders for Precise Control of Diffusion Models☆1,129Jun 20, 2025Updated 8 months ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,897Oct 31, 2024Updated last year
- The ultimate training toolkit for finetuning diffusion models☆9,575Updated this week
- Model Compression Toolbox for Large Language Models and Diffusion Models☆759Aug 14, 2025Updated 6 months ago
- ☆46Nov 20, 2025Updated 3 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆512Dec 11, 2024Updated last year
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆566Sep 16, 2024Updated last year
- [ICLR 2025] Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"☆262Jan 12, 2026Updated last month
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆722Dec 2, 2024Updated last year