sayakpaul / diffusers-torchaoLinks
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
☆366Updated last month
Alternatives and similar repositories for diffusers-torchao
Users that are interested in diffusers-torchao are comparing it to the libraries listed below
Sorting:
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆269Updated 9 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆329Updated last week
- Faster generation with text-to-image diffusion models.☆215Updated 2 weeks ago
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆602Updated 4 months ago
- ☆283Updated 6 months ago
- faster parallel inference of mochi-1 video generation model☆123Updated 4 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆403Updated 4 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆234Updated 6 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆202Updated 4 months ago
- Model Compression Toolbox for Large Language Models and Diffusion Models☆526Updated 3 months ago
- ☆430Updated last year
- Radial Attention Official Implementation☆303Updated last week
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆483Updated 7 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆694Updated 7 months ago
- Scalable and memory-optimized training of diffusion models☆1,207Updated last month
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆186Updated 3 months ago
- Making Flux go brrr on GPUs.☆95Updated last week
- Generate long weighted prompt embeddings for Stable Diffusion☆128Updated 2 months ago
- Enhance-A-Video: Better Generated Video for Free☆548Updated 3 months ago
- ☆125Updated 4 months ago
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆388Updated 3 months ago
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆546Updated 9 months ago
- ☆127Updated last week
- https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.☆1,275Updated 3 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆424Updated last year
- The best OSS video generation models☆134Updated 8 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆207Updated 3 months ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆963Updated last month
- ☆541Updated 7 months ago
- A detailed diagram laying out the full Flux.1 [dev] architecture as shared by Black Forest Labs at https://github.com/black-forest-labs/f…☆67Updated 8 months ago