Tracking the latest and greatest research papers on video generation.
☆137Mar 19, 2026Updated this week
Alternatives and similar repositories for Video-Generation-Paper-List
Users that are interested in Video-Generation-Paper-List are comparing it to the libraries listed below
Sorting:
- This is the official PyTorch implementation of "BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation."☆40Oct 9, 2025Updated 5 months ago
- ☆130Feb 28, 2026Updated 2 weeks ago
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆39Nov 5, 2025Updated 4 months ago
- Tracking the latest and greatest research papers on diffusion large language models.☆23Mar 13, 2026Updated last week
- [ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey…☆698Nov 11, 2025Updated 4 months ago
- An instruction to 1) download the Kinetics-400/Kinetics-600, 2) resize the videos, and 3) prepare annotations.☆11Jan 19, 2022Updated 4 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆109Dec 20, 2025Updated 3 months ago
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38May 21, 2025Updated 9 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆119May 19, 2025Updated 10 months ago
- ☆13Feb 26, 2025Updated last year
- PyQt6 GUI to queue and render images and videos using ComfyUI Workflows☆18Mar 5, 2025Updated last year
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆62Jun 26, 2025Updated 8 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆84Jul 6, 2025Updated 8 months ago
- official code for unigame☆19Nov 26, 2025Updated 3 months ago
- PISCO: Precise Video Instance Insertion with Sparse Control☆49Feb 13, 2026Updated last month
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 9 months ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Jan 27, 2025Updated last year
- Pretrained Diffusion Models for Unified Human Motion Synthesis☆18Feb 28, 2023Updated 3 years ago
- The official repository for "From Hearing to Seeing: Linking Auditory and Visual Place Perceptions with Soundscape-to-Image Generative Ar…☆21Dec 13, 2025Updated 3 months ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆42Jan 9, 2026Updated 2 months ago
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization☆22Jul 13, 2025Updated 8 months ago
- [NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation☆200Mar 8, 2026Updated last week
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆32Mar 26, 2025Updated 11 months ago
- ☆43Jan 13, 2025Updated last year
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆21Feb 23, 2025Updated last year
- [ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models☆42Nov 20, 2025Updated 4 months ago
- ☆285Feb 3, 2026Updated last month
- This repository will collect and share awesome ChatGPT related papers and useful tools☆18Apr 2, 2023Updated 2 years ago
- ☆14Jul 5, 2024Updated last year
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated last month
- Unofficial extension implementation of CausVid☆75Apr 28, 2025Updated 10 months ago
- A collection of awesome video generation studies.☆752Dec 27, 2025Updated 2 months ago
- Awesome Controllable Video Generation with Diffusion Models☆60Jul 22, 2025Updated 7 months ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆48Oct 10, 2025Updated 5 months ago
- [ACM MM 2025] HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation☆157Sep 4, 2025Updated 6 months ago
- Official inference code for SoulX-LiveAct: Towards Hour-Scale Real-Time Human Animation with Neighbor Forcing and ConvKV Memory☆228Updated this week