Shenyi-Z/Cache4Diffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Shenyi-Z/Cache4Diffusion)

Shenyi-Z / Cache4Diffusion

Aiming to integrate most existing feature caching-based diffusion acceleration schemes into a unified framework.

☆109

Alternatives and similar repositories for Cache4Diffusion

Users that are interested in Cache4Diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tammytcl / Awesome-Diffusion-Acceleration-Cache
View on GitHub
A curated list of research papers, resources, and advancements on Diffusion Cache and related efficient diffusion model acceleration tech…
☆86Nov 4, 2025Updated 8 months ago
Shenyi-Z / TaylorSeer
View on GitHub
[ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
☆406Mar 2, 2026Updated 4 months ago
bytedance / ERTACache
View on GitHub
☆24Sep 4, 2025Updated 10 months ago
Bujiazi / DiCache
View on GitHub
[ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache
☆61Jan 26, 2026Updated 5 months ago
jiwoogit / SeaCache
View on GitHub
[CVPR 2026 Oral, Best Paper Finalist] SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models
☆89Jun 29, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
argsss / DPCache
View on GitHub
[CVPR 2026] Denoising as Path Planning: Training-Free Acceleration of Diffusion Models with DPCache
☆41Jul 1, 2026Updated 2 weeks ago
fenglang918 / HiCache
View on GitHub
HiCache: Hermite Polynomial-based Feature Cache for diffusion inference
☆15Jan 27, 2026Updated 5 months ago
Shenyi-Z / ToCa
View on GitHub
[ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching
☆221Mar 14, 2025Updated last year
vita-epfl / SenCache
View on GitHub
[CVPR 2026 Oral] SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching
☆21Jun 5, 2026Updated last month
Tencent-Hunyuan / DisCa
View on GitHub
DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching
☆24Apr 15, 2026Updated 3 months ago
Bluear7878 / H2-Cache-A-Hierarchical-Dual-Stage-Cache
View on GitHub
☆22Nov 3, 2025Updated 8 months ago
vipshop / cache-dit
View on GitHub
A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.
☆1,234Updated this week
xuyang-liu16 / Awesome-Generation-Acceleration
View on GitHub
📚 Collection of awesome generation acceleration resources.
☆401Jul 7, 2025Updated last year
NoakLiu / FastCache-xDiT
View on GitHub
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]
☆52Apr 29, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mikeallen39 / FlowCache
View on GitHub
[ICLR2026] The open-source code for FlowCache, including accelerated implementations of the MAGI-1 and Skyreels-V2.
☆29Apr 24, 2026Updated 2 months ago
xlite-dev / Awesome-DiT-Inference
View on GitHub
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
☆578Jun 13, 2026Updated last month
svg-project / Quant-VideoGen
View on GitHub
[ICML2026] Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
☆60Jun 4, 2026Updated last month
UnicomAI / LeMiCa
View on GitHub
[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
☆122Jun 22, 2026Updated 3 weeks ago
sandyresearch / chipmunk
View on GitHub
🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …
☆111Sep 8, 2025Updated 10 months ago
svg-project / Sparse-VideoGen
View on GitHub
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
☆692Jul 4, 2026Updated 2 weeks ago
feice-huang / ConvRot
View on GitHub
Official ConvRot implementation. A plug-and-play, convolution-like rotation module enabling efficient W4A4 quantization for diffusion mod…
☆19Jul 3, 2026Updated 2 weeks ago
EvelynZhang-epiclab / SiTo
View on GitHub
[AAAI-2025] The offical code for SiTo （Similarity-based Token Pruning for Stable Diffusion Models）
☆46Jun 2, 2025Updated last year
JIA-Lab-research / Jenga
View on GitHub
[NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving
☆287Aug 4, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Juanerx / Q-DiT
View on GitHub
[CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
☆79Sep 3, 2024Updated last year
yuriYanZeXuan / EEdit
View on GitHub
(ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing
☆62Sep 17, 2025Updated 10 months ago
thu-nics / DiTFastAttn
View on GitHub
☆192Jan 14, 2025Updated last year
maomaocun / dLLM-cache
View on GitHub
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆211May 1, 2026Updated 2 months ago
HelloZicky / NaviCache
View on GitHub
[ICML 2026] Official implementation of "NaviCache: Test-Time Self-Calibration Caching for Video Generation".
☆27Jul 9, 2026Updated last week
joelulu / Awesome-Acceleration-GenAI
View on GitHub
Collection of Acceleration Methods for Generative AI
☆29Dec 9, 2025Updated 7 months ago
prathebaselva / FORA
View on GitHub
FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.
☆56Jul 8, 2024Updated 2 years ago
ZichenWen1 / EPIC
View on GitHub
(NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"
☆49Feb 11, 2026Updated 5 months ago
thu-nics / ViDiT-Q
View on GitHub
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
☆163Mar 21, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ali-vilab / TeaCache
View on GitHub
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
☆1,356Jun 8, 2025Updated last year
thu-ml / SpargeAttn
View on GitHub
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
☆1,017Feb 25, 2026Updated 4 months ago
hanjq17 / CHORDS
View on GitHub
[ICCV 2025] CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers
☆17Mar 3, 2026Updated 4 months ago
InnovatorLM / Innovator-VL
View on GitHub
Fully Open-source Multimodal Language Models for Science Discovery
☆167Mar 20, 2026Updated 4 months ago
ModelTC / QVGen
View on GitHub
[ICLR 2026] This is the official PyTorch implementation of "QVGen: Pushing the Limit of Quantized Video Generative Models".
☆32Feb 11, 2026Updated 5 months ago
mayuelala / EasyVFX
View on GitHub
[SIGGRAPH 2026] EasyVFX: This repo is the official implementation of "EasyVFX: Frequency-Driven Decoupling for Resource-Efficient VFX Gen…
☆21May 24, 2026Updated last month
ChenyuWang-Monica / REED
View on GitHub
Code for paper: "Learning Diffusion Models with Flexible Representation Guidance"
☆16Mar 18, 2026Updated 4 months ago