Peyton-Chen / Sparse-vDiTLinks
The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv 2025)
☆43Updated 2 months ago
Alternatives and similar repositories for Sparse-vDiT
Users that are interested in Sparse-vDiT are comparing it to the libraries listed below
Sorting:
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆41Updated last month
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆53Updated 4 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆156Updated 9 months ago
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆42Updated 4 months ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆67Updated 3 months ago
- DC-Gen: Accelerating Diffusion Models with Compressed Latent Space☆53Updated 2 weeks ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆111Updated 2 months ago
- Lumos Project: Frontier generative model research by Alibaba DAMO Academy, including Lumos-1, etc.☆127Updated last month
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆122Updated 4 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆115Updated 3 months ago
- ☆19Updated 4 months ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆48Updated last year
- Official PyTorch/Diffusers implementation of "RectifiedHR: Enable Efficient High Resolution Image Generation via Energy Rectification"☆21Updated 5 months ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆71Updated 4 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆82Updated 3 months ago
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆134Updated 4 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆37Updated 2 months ago
- ☆88Updated last week
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 9 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆42Updated last month
- TPDiff: Temporal Pyramid Video Diffusion Model☆20Updated 5 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆71Updated 10 months ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆52Updated 5 months ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆52Updated 5 months ago
- ☆61Updated 5 months ago
- Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving☆232Updated 3 weeks ago
- Code for Draft Attention☆90Updated 3 months ago
- Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆151Updated last week
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆65Updated 3 months ago