MingXiangL / Teacache-xDiT
Combining Teacache with xDiT to Accelerate Visual Generation Models
☆15Updated this week
Alternatives and similar repositories for Teacache-xDiT:
Users that are interested in Teacache-xDiT are comparing it to the libraries listed below
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆255Updated 3 weeks ago
- Accelerating Diffusion Transformers with Token-wise Feature Caching☆132Updated last month
- A parallelism VAE avoids OOM for high resolution image generation☆61Updated 3 months ago
- ☆160Updated 3 months ago
- [ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation☆77Updated last month
- Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity☆178Updated last week
- Model Compression Toolbox for Large Language Models and Diffusion Models☆435Updated 3 weeks ago
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆35Updated 5 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆210Updated 4 months ago
- [CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Mo…☆62Updated 8 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆197Updated 2 months ago
- 📖A curated list of Awesome Diffusion Inference Papers with codes: Sampling, Caching, Multi-GPUs, etc. 🎉🎉☆212Updated last month
- 📚 Collection of awesome generation acceleration resources.☆215Updated this week
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆145Updated 5 months ago
- An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional var…☆125Updated 2 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆677Updated 4 months ago
- ☆114Updated this week
- SpargeAttention: A training-free sparse attention that can accelerate any model inference.☆488Updated this week
- The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models☆99Updated last year
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆101Updated 9 months ago
- X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation☆60Updated 3 weeks ago
- [ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.☆348Updated last year
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆204Updated 2 weeks ago
- QuEST: Efficient Finetuning for Low-bit Diffusion Models☆43Updated 3 months ago
- [CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆47Updated 7 months ago
- From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆72Updated 3 weeks ago
- Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)☆136Updated 2 years ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆342Updated 2 months ago
- This is the official repo for the paper "Accelerating Parallel Sampling of Diffusion Models" Tang et al. ICML 2024 https://openreview.net…☆13Updated 9 months ago
- A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training☆248Updated this week