Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.
☆48Jul 17, 2025Updated 11 months ago
Alternatives and similar repositories for SmoothCache
Users that are interested in SmoothCache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆192Jan 14, 2025Updated last year
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated 5 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆215Sep 27, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆67Feb 22, 2026Updated 4 months ago
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆79Jun 11, 2025Updated last year
- [ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…☆32Apr 14, 2026Updated 2 months ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆56Jul 8, 2024Updated last year
- Code for Draft Attention☆103May 22, 2025Updated last year
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆152Mar 31, 2026Updated 2 months ago
- ☆35Jan 21, 2025Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆418Feb 26, 2025Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 9 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆219Sep 27, 2025Updated 9 months ago
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆172Nov 5, 2024Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆34Nov 29, 2024Updated last year
- Transforming Video Diffusion with Temporal Sparse Attention☆54Apr 8, 2026Updated 2 months ago
- Model Compression Toolbox for Large Language Models and Diffusion Models☆790Aug 14, 2025Updated 10 months ago
- (ACL2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆36May 28, 2025Updated last year
- ☆53Dec 20, 2024Updated last year
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆38Feb 6, 2025Updated last year
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆18Nov 24, 2024Updated last year
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆51Mar 25, 2025Updated last year
- Distilling Diversity and Control in Diffusion Models☆52Apr 28, 2025Updated last year
- ☆15Mar 20, 2025Updated last year
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆149Oct 9, 2025Updated 8 months ago
- ☆24Dec 23, 2024Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005☆49Nov 8, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆167Dec 1, 2025Updated 6 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆317Dec 23, 2024Updated last year
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆69Dec 25, 2025Updated 6 months ago
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆73Jun 9, 2025Updated last year
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated 2 years ago
- DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing (WACV 2025)☆13Feb 7, 2026Updated 4 months ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆47Nov 24, 2025Updated 7 months ago