End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
☆397Jan 8, 2026Updated 3 months ago
Alternatives and similar repositories for diffusers-torchao
Users that are interested in diffusers-torchao are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆426Jul 5, 2025Updated 10 months ago
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆284Oct 12, 2024Updated last year
- https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.☆1,306Mar 27, 2025Updated last year
- A general fine-tuning kit geared toward image/video/audio diffusion models.☆2,824Apr 27, 2026Updated last week
- PyTorch native quantization and sparsity for training and inference☆2,807Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Scalable and memory-optimized training of diffusion models☆1,359Apr 8, 2026Updated 3 weeks ago
- Model Compression Toolbox for Large Language Models and Diffusion Models☆779Aug 14, 2025Updated 8 months ago
- Faster generation with text-to-image diffusion models.☆232Jun 28, 2025Updated 10 months ago
- Accelerates Flux.1 image generation, just by using this node.☆140Dec 19, 2024Updated last year
- ☆192Jan 14, 2025Updated last year
- Training-free Regional Prompting for Diffusion Transformers 🔥☆696Nov 28, 2024Updated last year
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models☆3,826Mar 7, 2026Updated last month
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆214Sep 27, 2025Updated 7 months ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆1,317Jun 8, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism☆2,610Apr 27, 2026Updated last week
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆21Sep 20, 2024Updated last year
- A unified inference and post-training framework for accelerated video generation.☆3,446Updated this week
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆574Sep 16, 2024Updated last year
- Making Flux go brrr on GPUs.☆166Jan 5, 2026Updated 4 months ago
- [CVPR 2024] DeepCache: Accelerating Diffusion Models for Free☆963Jun 27, 2024Updated last year
- [ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.☆991Feb 25, 2026Updated 2 months ago
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- [WIP] Better (FP8) attention for Hopper☆33Feb 24, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OneDiff: An out-of-the-box acceleration library for diffusion models.☆1,971Dec 4, 2025Updated 5 months ago
- ☆175Sep 17, 2025Updated 7 months ago
- A pytorch quantization backend for optimum☆1,038Apr 2, 2026Updated last month
- ☆2,240Nov 8, 2024Updated last year
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆405Mar 19, 2025Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆418Feb 26, 2025Updated last year
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆662Mar 6, 2026Updated last month
- (NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis☆1,316Mar 5, 2025Updated last year
- faster parallel inference of mochi-1 video generation model☆125Feb 25, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆663Mar 11, 2025Updated last year
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆516Dec 11, 2024Updated last year
- Cog inference for flux models☆369Jul 31, 2025Updated 9 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- Triton kernels for Flux☆23Jul 7, 2025Updated 9 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆215Sep 27, 2025Updated 7 months ago
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!☆841Jan 7, 2026Updated 3 months ago