Peyton-Chen / Sparse-vDiTLinks
The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv 2025)
☆42Updated last month
Alternatives and similar repositories for Sparse-vDiT
Users that are interested in Sparse-vDiT are comparing it to the libraries listed below
Sorting:
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆34Updated 2 weeks ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆51Updated 3 months ago
- Lumos Project: Frontier generative model research by Alibaba DAMO Academy, including Lumos-1, etc.☆80Updated this week
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆63Updated last month
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆105Updated 3 weeks ago
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆39Updated 2 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆152Updated 8 months ago
- ☆19Updated 3 months ago
- Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆106Updated last week
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆120Updated 2 months ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆69Updated 3 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆67Updated 2 months ago
- Code for Draft Attention☆87Updated last month
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆47Updated last year
- Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving☆220Updated 2 weeks ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆112Updated 2 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 8 months ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆50Updated 3 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 8 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆76Updated 2 months ago
- ☆50Updated 7 months ago
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated 9 months ago
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆50Updated 3 months ago
- ☆46Updated 4 months ago
- SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation (CVPR 2024)☆65Updated last week
- ☆82Updated 3 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆44Updated 3 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆38Updated 2 weeks ago
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆18Updated last year
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆51Updated 4 months ago