horseee / learning-to-cacheLinks

[NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching

☆107

Alternatives and similar repositories for learning-to-cache

Users that are interested in learning-to-cache are comparing it to the libraries listed below

Sorting:

prathebaselva / FORA
FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.
☆47Updated last year
AdaCache-DiT / AdaCache
Adaptive Caching for Faster Video Generation with Diffusion Transformers
☆152Updated 8 months ago
ThisisBillhe / ZipAR
[ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…
☆50Updated 3 months ago
thu-nics / DiTFastAttn
☆170Updated 6 months ago
NUS-HPC-AI-Lab / SpeeD
SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
☆182Updated 5 months ago
ThisisBillhe / EfficientDM
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…
☆60Updated last year
mit-han-lab / lpd
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆55Updated last week
Juanerx / Q-DiT
[CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
☆53Updated 10 months ago
czg1225 / CoDe
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆102Updated 3 months ago
NUS-HPC-AI-Lab / Dynamic-Diffusion-Transformer
☆82Updated 3 months ago
VainF / TinyFusion
[CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow
☆130Updated 3 months ago
horseee / dKV-Cache
☆88Updated last month
ModelTC / TFMQ-DM
[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for…
☆102Updated this week
hp-l33 / ARPG
Autoregressive Image Generation with Randomized Parallel Decoding
☆68Updated 3 months ago
shawnricecake / draft-attention
Code for Draft Attention
☆85Updated last month
OpenSparseLLMs / Skip-DiT
✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints
☆71Updated 3 months ago
byeongjun-park / Switch-DiT
[ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"
☆43Updated last year
microsoft / RAS
An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional var…
☆133Updated 3 weeks ago
A-suozhang / Awesome-Efficient-Diffusion
Curated list of methods that focuses on improving the efficiency of diffusion models
☆45Updated last year
NVlabs / T-Stitch
[ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…
☆103Updated last year
ziplab / PTQD
The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models
☆99Updated last year
svg-project / Sparse-VideoGen
[ICML2025] Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
☆364Updated last month
Lucky-Lance / TerDiT
TerDiT: Ternary Diffusion Models with Transformers
☆71Updated last year
ThisisBillhe / NAR
The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"
☆51Updated 3 months ago
Shenyi-Z / ToCa
Accelerating Diffusion Transformers with Token-wise Feature Caching
☆162Updated 4 months ago
maple-research-lab / SIM
Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]
☆81Updated 7 months ago
LINs-lab / GMem
[Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models
☆37Updated 4 months ago
tang-bd / fuse-dit
[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
☆112Updated 2 months ago
VainF / Diff-Pruning
[NeurIPS 2023] Structural Pruning for Diffusion Models
☆198Updated last year
MonoFormer / MonoFormer
The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"
☆86Updated 9 months ago