BienLuky / CacheQuantLinks

[CVPR 2025] The official implementation of "CacheQuant: Comprehensively Accelerated Diffusion Models"

☆28

Alternatives and similar repositories for CacheQuant

Users that are interested in CacheQuant are comparing it to the libraries listed below

Sorting:

ModelTC / TFMQ-DM
[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for…
☆109Updated last month
mit-han-lab / lpd
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆78Updated 4 months ago
Shenyi-Z / ToCa
[ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching
☆195Updated 8 months ago
ThisisBillhe / ZipAR
[ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…
☆53Updated 7 months ago
thu-nics / DiTFastAttn
☆186Updated 10 months ago
czg1225 / CoDe
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆107Updated last month
Shenyi-Z / DuCa
(ToCa-v2) A New version of ToCa，with faster speed and better acceleration!
☆38Updated 8 months ago
thu-nics / ViDiT-Q
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
☆132Updated 7 months ago
A-suozhang / Awesome-Efficient-Diffusion
Curated list of methods that focuses on improving the efficiency of diffusion models
☆44Updated last year
Juanerx / Q-DiT
[CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
☆72Updated last year
NUS-HPC-AI-Lab / Dynamic-Diffusion-Transformer
☆89Updated 7 months ago
VainF / TinyFusion
[CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow
☆145Updated 7 months ago
horseee / learning-to-cache
[NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
☆116Updated last year
csguoh / FastVAR
[ICCV2025]Generate one 2K image on single 3090 GPU!
☆78Updated 2 months ago
ChangyuanWang17 / APQ-DM
This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poste…
☆38Updated last year
xuyang-liu16 / Awesome-Generation-Acceleration
📚 Collection of awesome generation acceleration resources.
☆363Updated 4 months ago
ThisisBillhe / EfficientDM
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…
☆66Updated last year
BienLuky / EDA-DM
The official implementation of "EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models"
☆15Updated 4 months ago
hatchetProject / QuEST
[ICCV 2025] QuEST: Efficient Finetuning for Low-bit Diffusion Models
☆55Updated 4 months ago
AdaCache-DiT / AdaCache
Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"
☆160Updated last year
thu-nics / FrameFusion
[ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"
☆66Updated 2 weeks ago
hp-l33 / ARPG
Autoregressive Image Generation with Randomized Parallel Decoding
☆81Updated 3 weeks ago
yu-rp / Dimple
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆109Updated 4 months ago
NUS-HPC-AI-Lab / HASTE
☆36Updated 5 months ago
qhfan / MALA
[ICCV2025 highlight]Rectifying Magnitude Neglect in Linear Attention
☆48Updated 3 months ago
ChangyuanWang17 / QVLM
[NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.
☆91Updated 10 months ago
JunyiWuCode / QuantCache
[ICCV 2025] QuantCache：Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
☆14Updated last month
prathebaselva / FORA
FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.
☆51Updated last year
NUS-HPC-AI-Lab / Dynamic-Tuning
The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"
☆50Updated 10 months ago
42Shawn / PTQ4DM
Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)
☆139Updated 2 years ago