QuenithAI / Video-Generation-Paper-ListLinks
Tracking the latest and greatest research papers on video generation.
☆55Updated last week
Alternatives and similar repositories for Video-Generation-Paper-List
Users that are interested in Video-Generation-Paper-List are comparing it to the libraries listed below
Sorting:
- ☆64Updated 5 months ago
- FACM: Flow-Anchored Consistency Models☆115Updated 3 weeks ago
- ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer☆35Updated 8 months ago
- [Preprint] UCGM: Unified Continuous Generative Models☆166Updated 3 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆82Updated 9 months ago
- PyTorch re-implementation for MeanFlow☆95Updated last month
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆134Updated last month
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆178Updated 2 months ago
- Official repository for Muti-human Interactive Talking Dataset☆44Updated 3 weeks ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆123Updated 4 months ago
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆34Updated 4 months ago
- Towards training VQ-VAE models robustly!☆83Updated last month
- ☆122Updated 2 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆80Updated 4 months ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆85Updated 3 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆50Updated 8 months ago
- This is the official implementation for DragVideo☆52Updated 11 months ago
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆199Updated 5 months ago
- Blending Custom Photos with Video Diffusion Transformers☆47Updated 7 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆37Updated 2 months ago
- OpenVideo specializes in the domain of text-to-video generation, with the goal of providing high-quality and diverse video datasets to AI…☆108Updated 3 months ago
- ☆65Updated last month
- ☆128Updated 2 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆53Updated 4 months ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆294Updated 5 months ago
- Code for D-DiT☆45Updated 5 months ago
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆58Updated 2 months ago
- Awesome Controllable Video Generation with Diffusion Models☆55Updated last month
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆110Updated 4 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆279Updated 9 months ago