Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.
☆48Jul 17, 2025Updated 10 months ago
Alternatives and similar repositories for SmoothCache
Users that are interested in SmoothCache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆192Jan 14, 2025Updated last year
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated 4 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆214Sep 27, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆65Feb 22, 2026Updated 2 months ago
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆79Jun 11, 2025Updated 11 months ago
- [ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…☆31Apr 14, 2026Updated last month
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆56Jul 8, 2024Updated last year
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 11 months ago
- Code for Draft Attention☆103May 22, 2025Updated 11 months ago
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆152Mar 31, 2026Updated last month
- ☆35Jan 21, 2025Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆418Feb 26, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 7 months ago
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆171Nov 5, 2024Updated last year
- Transforming Video Diffusion with Temporal Sparse Attention☆48Apr 8, 2026Updated last month
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆44Jun 9, 2025Updated 11 months ago
- Model Compression Toolbox for Large Language Models and Diffusion Models☆783Aug 14, 2025Updated 9 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated last year
- ☆38Feb 6, 2025Updated last year
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆51Mar 25, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Distilling Diversity and Control in Diffusion Models☆52Apr 28, 2025Updated last year
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆150Oct 9, 2025Updated 7 months ago
- ☆24Dec 23, 2024Updated last year
- PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005☆47Nov 8, 2024Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆166Dec 1, 2025Updated 5 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆315Dec 23, 2024Updated last year
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆55Dec 25, 2025Updated 4 months ago
- [CVPR 2026 Highlight] Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers☆63Apr 24, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated last year
- This repository contains the code and released models for the paper Segmenting Text and Learning Their Rewards for Improved RLHF in Langu…☆19Jan 8, 2025Updated last year
- DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing (WACV 2025)☆13Feb 7, 2026Updated 3 months ago
- [ICCV 2025] TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆40Nov 27, 2024Updated last year
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆47Nov 24, 2025Updated 5 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆185Mar 20, 2025Updated last year
- ☆16Apr 30, 2024Updated 2 years ago