CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
☆90May 12, 2025Updated 9 months ago
Alternatives and similar repositories for pa_vdm
Users that are interested in pa_vdm are comparing it to the libraries listed below
Sorting:
- ☆10Nov 18, 2024Updated last year
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆167Nov 5, 2024Updated last year
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆321Mar 30, 2025Updated 11 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Oct 8, 2024Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆309Mar 12, 2025Updated 11 months ago
- ☆92Mar 26, 2025Updated 11 months ago
- ☆11Sep 28, 2024Updated last year
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆306Jun 29, 2025Updated 8 months ago
- ☆24Feb 21, 2025Updated last year
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆118May 14, 2025Updated 9 months ago
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆767Dec 5, 2024Updated last year
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆198May 11, 2025Updated 9 months ago
- [CVPR2025] Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think☆23Jul 1, 2025Updated 8 months ago
- ☆52Dec 13, 2024Updated last year
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆629Feb 3, 2026Updated last month
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- ☆643May 24, 2024Updated last year
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆53Jan 5, 2026Updated last month
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆28Aug 26, 2025Updated 6 months ago
- Scalable and memory-optimized training of diffusion models☆1,341Jun 4, 2025Updated 9 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆629Oct 29, 2025Updated 4 months ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆124Jan 25, 2025Updated last year
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆630Jul 1, 2025Updated 8 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆98Feb 11, 2025Updated last year
- Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper☆125Jan 9, 2025Updated last year
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆39May 4, 2025Updated 10 months ago
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆392Jan 19, 2025Updated last year
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆322Aug 10, 2024Updated last year
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆295May 17, 2025Updated 9 months ago
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆515Jun 17, 2025Updated 8 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆671Feb 13, 2026Updated 2 weeks ago
- [CVPR 2024] "Taming Mode Collapse in Score Distillation for Text-to-3D Generation" by Peihao Wang, Dejia Xu, Zhiwen Fan, Dilin Wang, Srey…☆51Feb 2, 2024Updated 2 years ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆314Jan 31, 2025Updated last year
- Keyframe Interpolation with CogvideoX☆139Oct 31, 2024Updated last year
- ☆34Dec 29, 2025Updated 2 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated 9 months ago
- CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control☆172Dec 2, 2024Updated last year