Tracking the latest and greatest research papers on video generation.
☆162Mar 28, 2026Updated last month
Alternatives and similar repositories for Video-Generation-Paper-List
Users that are interested in Video-Generation-Paper-List are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] This is the official PyTorch implementation of "BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Gen…☆42Oct 9, 2025Updated 7 months ago
- [ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey…☆732Apr 13, 2026Updated 3 weeks ago
- ☆46Jan 19, 2026Updated 3 months ago
- ☆147Feb 28, 2026Updated 2 months ago
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆42Apr 24, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR2026] Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning☆45Mar 27, 2026Updated last month
- VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation☆19Jun 2, 2025Updated 11 months ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Tracking the latest and greatest research papers on diffusion large language models.☆33Mar 13, 2026Updated last month
- ☆15Oct 24, 2024Updated last year
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆112Dec 20, 2025Updated 4 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆120May 19, 2025Updated 11 months ago
- ☆13Feb 26, 2025Updated last year
- [ICCV 2025] Official PyTorch Implementation of "Learning Self-supervised Part-aware 3D Hybrid Representations of 2D Gaussians and Superqu…☆66Dec 22, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption☆38May 21, 2025Updated 11 months ago
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆62Jun 26, 2025Updated 10 months ago
- [ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation☆204Jun 8, 2025Updated 11 months ago
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆84Jul 6, 2025Updated 10 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆40Jan 6, 2024Updated 2 years ago
- [ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆159Apr 6, 2026Updated last month
- PISCO: Precise Video Instance Insertion with Sparse Control☆59Feb 13, 2026Updated 2 months ago
- [CVPR'26] UniGame code implementation☆19Apr 21, 2026Updated 2 weeks ago
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 11 months ago
- ME-GraphAU on Video☆11May 10, 2024Updated last year
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Jan 27, 2025Updated last year
- Pretrained Diffusion Models for Unified Human Motion Synthesis☆18Feb 28, 2023Updated 3 years ago
- [ICLR 2026] Official code for [EdiVal-Agent Automated, object-centric evaluation for multi-turn instruction-based image editing]☆27Mar 1, 2026Updated 2 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer☆846Apr 27, 2025Updated last year
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆46Apr 15, 2026Updated 3 weeks ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆21Feb 23, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official UniVerse-1 code.☆126Oct 13, 2025Updated 6 months ago
- This repository will collect and share awesome ChatGPT related papers and useful tools☆17Apr 2, 2023Updated 3 years ago
- ☆294Feb 3, 2026Updated 3 months ago
- Developer project for getting basic API integrations working in under 5 minutes☆11Updated this week
- ☆13Jul 5, 2024Updated last year
- Official code for the CVPR'23 paper "Continuous Intermediate Token Learning with Implicit Motion Manifold for Keyframe based Motion Inter…☆28Sep 19, 2023Updated 2 years ago
- Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"☆48Sep 3, 2025Updated 8 months ago