FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.
☆52Jul 8, 2024Updated last year
Alternatives and similar repositories for FORA
Users that are interested in FORA are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆117Jul 15, 2024Updated last year
- ☆190Jan 14, 2025Updated last year
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆74Feb 9, 2026Updated 2 weeks ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆24Mar 4, 2025Updated 11 months ago
- [ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching☆210Mar 14, 2025Updated 11 months ago
- ☆49Mar 3, 2024Updated last year
- 📚 Collection of awesome generation acceleration resources.☆388Jul 7, 2025Updated 7 months ago
- (ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆60Sep 17, 2025Updated 5 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆259Dec 27, 2024Updated last year
- [ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆373Feb 16, 2026Updated last week
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆167Nov 5, 2024Updated last year
- Data distillation benchmark☆72Jun 13, 2025Updated 8 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- (ToCa-v2) A New version of ToCa,with faster speed and better acceleration!☆40Mar 13, 2025Updated 11 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆47Jul 17, 2025Updated 7 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆212Sep 27, 2025Updated 5 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆73Oct 21, 2025Updated 4 months ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆80May 22, 2025Updated 9 months ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- ☆14Aug 1, 2025Updated 6 months ago
- ☆12Jan 4, 2024Updated 2 years ago
- Kernel Library Wheel for SGLang☆17Feb 22, 2026Updated last week
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis☆12Oct 3, 2024Updated last year
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- [NeurIPS 2025] Reward-Instruct: A Reward-Centric Approach to Fast Photo-Realistic Image Generation☆34Oct 24, 2025Updated 4 months ago
- ☆92Mar 26, 2025Updated 11 months ago
- Example of applying CUDA graphs to LLaMA-v2☆12Aug 25, 2023Updated 2 years ago
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆33Nov 29, 2024Updated last year
- [CVPR 2024] DeepCache: Accelerating Diffusion Models for Free☆957Jun 27, 2024Updated last year
- ☆33Nov 4, 2024Updated last year
- ☆13Jan 11, 2026Updated last month
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆109Nov 27, 2024Updated last year
- 📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉☆525Updated this week
- VideoSys: An easy and efficient system for video generation☆2,016Aug 27, 2025Updated 6 months ago