Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.
☆48Jul 17, 2025Updated 8 months ago
Alternatives and similar repositories for SmoothCache
Users that are interested in SmoothCache are comparing it to the libraries listed below
Sorting:
- ☆191Jan 14, 2025Updated last year
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated 2 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆212Sep 27, 2025Updated 5 months ago
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆59Feb 22, 2026Updated 3 weeks ago
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆78Jun 11, 2025Updated 9 months ago
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration☆30Nov 22, 2025Updated 3 months ago
- Code for Draft Attention☆100May 22, 2025Updated 9 months ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆52Jul 8, 2024Updated last year
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 9 months ago
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆150Nov 3, 2025Updated 4 months ago
- ☆35Jan 21, 2025Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆416Feb 26, 2025Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆33Nov 29, 2024Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆41Jun 9, 2025Updated 9 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 5 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆214Sep 27, 2025Updated 5 months ago
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆170Nov 5, 2024Updated last year
- Transforming Video Diffusion with Temporal Sparse Attention☆46Updated this week
- Model Compression Toolbox for Large Language Models and Diffusion Models☆762Aug 14, 2025Updated 7 months ago
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆34May 28, 2025Updated 9 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated 11 months ago
- ☆52Dec 20, 2024Updated last year
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆16Nov 24, 2024Updated last year
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆53Mar 25, 2025Updated 11 months ago
- Distilling Diversity and Control in Diffusion Models☆51Apr 28, 2025Updated 10 months ago
- ☆15Mar 20, 2025Updated last year
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆149Oct 9, 2025Updated 5 months ago
- ☆24Dec 23, 2024Updated last year
- PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005☆45Nov 8, 2024Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- [CVPR 2026] Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers☆55Feb 22, 2026Updated 3 weeks ago
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆165Dec 1, 2025Updated 3 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆314Dec 23, 2024Updated last year
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆51Dec 25, 2025Updated 2 months ago
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated last year
- ☆19Jan 8, 2025Updated last year
- [ICCV 2025] TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆38Nov 27, 2024Updated last year