Tracking the latest and greatest research papers on video generation.
☆168Mar 28, 2026Updated 2 months ago
Alternatives and similar repositories for Video-Generation-Paper-List
Users that are interested in Video-Generation-Paper-List are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] This is the official PyTorch implementation of "BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Gen…☆45Oct 9, 2025Updated 8 months ago
- ☆47Jan 19, 2026Updated 5 months ago
- ☆152Feb 28, 2026Updated 3 months ago
- [ICCV 2025 / TCSVT 2026] FonTS: Text Rendering with Typography and Style Controls / WordCon: Word-level Typography Control in Visual Text…☆44May 26, 2026Updated 3 weeks ago
- ☆44Mar 31, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation☆20Jun 2, 2025Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- [CVPR2026] Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning☆54Mar 27, 2026Updated 2 months ago
- Tracking the latest and greatest research papers on diffusion large language models.☆33Mar 13, 2026Updated 3 months ago
- ☆15Oct 24, 2024Updated last year
- [INTERSPEECH 2026]Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆112Jun 3, 2026Updated 2 weeks ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆121May 19, 2025Updated last year
- [ICCV 2025] Official PyTorch Implementation of "Learning Self-supervised Part-aware 3D Hybrid Representations of 2D Gaussians and Superqu…☆70Dec 22, 2025Updated 5 months ago
- [NeurIPS24] Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos☆23May 30, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption☆38May 21, 2025Updated last year
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆62Jun 26, 2025Updated 11 months ago
- [ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation☆204Jun 8, 2025Updated last year
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆84Jul 6, 2025Updated 11 months ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Oct 19, 2022Updated 3 years ago
- [CVPR'26] UniGame code implementation☆20Apr 21, 2026Updated last month
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated 2 years ago
- ME-GraphAU on Video☆11May 10, 2024Updated 2 years ago
- [NIPS 2025] Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control☆48Apr 1, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://g…☆32May 15, 2024Updated 2 years ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆39Jan 27, 2025Updated last year
- ☆19Apr 2, 2026Updated 2 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer☆849Apr 27, 2025Updated last year
- Unified Codebase for Advanced World Models.☆823Jun 11, 2026Updated last week
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆50Apr 15, 2026Updated 2 months ago
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization☆21Jul 13, 2025Updated 11 months ago
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆23Jun 9, 2026Updated last week
- A modern implementation of MADDPG and MADDPG-Approx algorithms using PyTorch and PettingZoo environments. This project provides a clean, …☆20Mar 16, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- [ICLR 2026] Official code for [EdiVal-Agent Automated, object-centric evaluation for multi-turn instruction-based image editing]☆28Mar 1, 2026Updated 3 months ago
- [NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation☆222May 19, 2026Updated last month
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆21Feb 23, 2025Updated last year
- The official UniVerse-1 code.☆129Oct 13, 2025Updated 8 months ago
- PISCO: Precise Video Instance Insertion with Sparse Control☆62Feb 13, 2026Updated 4 months ago
- ☆44Jan 13, 2025Updated last year