sayakpaul/diffusers-torchao

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sayakpaul/diffusers-torchao)

sayakpaul / diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

☆399

Alternatives and similar repositories for diffusers-torchao

Users that are interested in diffusers-torchao are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chengzeyi / ParaAttention
View on GitHub
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
☆427Jul 5, 2025Updated last year
aredden / flux-fp8-api
View on GitHub
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…
☆287Oct 12, 2024Updated last year
chengzeyi / stable-fast
View on GitHub
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
☆1,304Mar 27, 2025Updated last year
bghira / SimpleTuner
View on GitHub
A general fine-tuning kit geared toward image/video/audio diffusion models.
☆2,885Updated this week
huggingface / flux-fast
View on GitHub
Making Flux go brrr on GPUs.
☆171Jan 5, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pytorch / ao
View on GitHub
PyTorch native quantization and sparsity for training and inference
☆2,914Updated this week
huggingface / finetrainers
View on GitHub
Scalable and memory-optimized training of diffusion models
☆1,355May 26, 2026Updated 2 months ago
nunchaku-ai / deepcompressor
View on GitHub
Model Compression Toolbox for Large Language Models and Diffusion Models
☆795Aug 14, 2025Updated 11 months ago
discus0434 / comfyui-flux-accelerator
View on GitHub
Accelerates Flux.1 image generation, just by using this node.
☆141Dec 19, 2024Updated last year
instantX-research / Regional-Prompting-FLUX
View on GitHub
Training-free Regional Prompting for Diffusion Transformers 🔥
☆696Nov 28, 2024Updated last year
thu-nics / DiTFastAttn
View on GitHub
☆192Jan 14, 2025Updated last year
huggingface / diffusion-fast
View on GitHub
Faster generation with text-to-image diffusion models.
☆234Jun 28, 2025Updated last year
nunchaku-ai / nunchaku
View on GitHub
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
☆3,915Mar 7, 2026Updated 4 months ago
czg1225 / AsyncDiff
View on GitHub
[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
☆215Sep 27, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mit-han-lab / radial-attention
View on GitHub
[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation
☆604Nov 11, 2025Updated 8 months ago
xdit-project / xDiT
View on GitHub
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
☆2,662Jul 14, 2026Updated last week
fofr / cog-flux-layers-explorer
View on GitHub
Explore how Flux Dev responds when you change the strengths of layers in the model.
☆21Sep 20, 2024Updated last year
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,879Updated this week
Vchitect / VEnhancer
View on GitHub
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
☆576Sep 16, 2024Updated last year
ali-vilab / TeaCache
View on GitHub
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
☆1,357Jun 8, 2025Updated last year
chengzeyi / piflux
View on GitHub
(WIP) Parallel inference for black-forest-labs' FLUX model.
☆19Nov 18, 2024Updated last year
horseee / DeepCache
View on GitHub
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
☆970Jun 27, 2024Updated 2 years ago
thu-ml / SpargeAttn
View on GitHub
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
☆1,019Feb 25, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
WaveSpeedAI / QuantumAttention
View on GitHub
[WIP] Better (FP8) attention for Hopper
☆33Feb 24, 2025Updated last year
siliconflow / onediff
View on GitHub
OneDiff: An out-of-the-box acceleration library for diffusion models.
☆1,964Dec 4, 2025Updated 7 months ago
huggingface / optimum-quanto
View on GitHub
A pytorch quantization backend for optimum
☆1,048Updated this week
vipshop / cache-dit
View on GitHub
A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.
☆1,239Jul 15, 2026Updated last week
junhahyung / STGuidance
View on GitHub
☆179Sep 17, 2025Updated 10 months ago
tianweiy / DMD2
View on GitHub
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,410Mar 5, 2025Updated last year
XLabs-AI / x-flux
View on GitHub
☆2,232Nov 8, 2024Updated last year
HaozheLiu-ST / T-GATE
View on GitHub
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
☆418Feb 26, 2025Updated last year
a-r-r-o-w / productionizing-diffusion
View on GitHub
Optimizing diffusion for production-ready speeds
☆40Jan 10, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
google / RB-Modulation
View on GitHub
Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"
☆404Mar 19, 2025Updated last year
xdit-project / mochi-xdit
View on GitHub
faster parallel inference of mochi-1 video generation model
☆123Feb 25, 2025Updated last year
replicate / cog-flux
View on GitHub
Cog inference for flux models
☆371Jul 31, 2025Updated 11 months ago
gojasper / flash-diffusion
View on GitHub
Flash Diffusion — accelerating conditional diffusion models (AAAI 2025 Oral)
☆662Mar 11, 2025Updated last year
G-U-N / Phased-Consistency-Model
View on GitHub
[NeurIPS 2024] Boosting the performance of consistency models with PCM!
☆520Dec 11, 2024Updated last year
svg-project / Sparse-VideoGen
View on GitHub
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
☆695Jul 4, 2026Updated 3 weeks ago
ai-compiler-study / triton-kernels
View on GitHub
Triton kernels for Flux
☆23Jul 7, 2025Updated last year