QuenithAI / Video-Generation-Paper-ListLinks
Tracking the latest and greatest research papers on video generation.
☆75Updated this week
Alternatives and similar repositories for Video-Generation-Paper-List
Users that are interested in Video-Generation-Paper-List are comparing it to the libraries listed below
Sorting:
- The official UniVerse-1 code.☆81Updated this week
- DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆128Updated this week
- ☆71Updated 6 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆129Updated 5 months ago
- 4-steps distilled version of Wan2.2-TI2V-5B☆80Updated 2 weeks ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆76Updated last week
- Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"☆138Updated last week
- StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation☆39Updated 3 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆186Updated 3 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆56Updated 2 months ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆92Updated 4 months ago
- [Preprint] UCGM: Unified Continuous Generative Models☆170Updated 4 months ago
- ☆129Updated 3 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆226Updated last month
- ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer☆36Updated 8 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆242Updated last month
- [ICCV 2025] MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance☆162Updated last month
- FACM: Flow-Anchored Consistency Models☆119Updated last month
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆35Updated 4 months ago
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆139Updated 2 months ago
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆91Updated 3 months ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆301Updated 5 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆91Updated last month
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆83Updated 4 months ago
- ☆127Updated 3 months ago
- ☆119Updated last month
- Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unifie…☆240Updated this week
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆65Updated last week
- Implementation Code for Omni-Effects☆148Updated 2 weeks ago
- ☆151Updated last week