CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
☆90May 12, 2025Updated 10 months ago
Alternatives and similar repositories for pa_vdm
Users that are interested in pa_vdm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 18, 2024Updated last year
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆170Nov 5, 2024Updated last year
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆322Mar 30, 2025Updated 11 months ago
- [CVPR2025] Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think☆23Jul 1, 2025Updated 8 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Oct 8, 2024Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆312Mar 12, 2025Updated last year
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆767Dec 5, 2024Updated last year
- ☆11Sep 28, 2024Updated last year
- ☆52Dec 13, 2024Updated last year
- Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper☆128Jan 9, 2025Updated last year
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆307Mar 7, 2026Updated 2 weeks ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆647Jul 1, 2025Updated 8 months ago
- Scalable and memory-optimized training of diffusion models☆1,343Jun 4, 2025Updated 9 months ago
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆646Mar 6, 2026Updated 2 weeks ago
- ☆646May 24, 2024Updated last year
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆120May 14, 2025Updated 10 months ago
- CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control☆171Dec 2, 2024Updated last year
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Sep 8, 2023Updated 2 years ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆200May 11, 2025Updated 10 months ago
- ☆93Mar 26, 2025Updated 11 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆51Nov 10, 2025Updated 4 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆636Oct 29, 2025Updated 4 months ago
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,269Aug 7, 2025Updated 7 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆322Aug 10, 2024Updated last year
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆514Jun 17, 2025Updated 9 months ago
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆398Jan 19, 2025Updated last year
- ☆23Feb 21, 2025Updated last year
- Pusa: Thousands Timesteps Video Diffusion Model☆674Feb 13, 2026Updated last month
- A reading list of video generation☆678Updated this week
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆482Oct 18, 2024Updated last year
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆124Jan 25, 2025Updated last year
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…☆253May 5, 2024Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆129Nov 29, 2024Updated last year
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 6 months ago
- Code for full fintuing Mochi model with FSDP (and CP)☆30Jul 15, 2025Updated 8 months ago
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆511Sep 2, 2024Updated last year
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆29Aug 26, 2025Updated 6 months ago