AlonzoLeeeooo/awesome-video-generation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AlonzoLeeeooo/awesome-video-generation)

AlonzoLeeeooo / awesome-video-generation

A collection of awesome video generation studies.

☆778

Alternatives and similar repositories for awesome-video-generation

Users that are interested in awesome-video-generation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AlonzoLeeeooo / awesome-text-to-image-studies
View on GitHub
A collection of awesome text-to-image generation studies.
☆761Apr 25, 2026Updated 3 months ago
showlab / Awesome-Video-Diffusion
View on GitHub
A curated list of recent diffusion models for video generation, editing, and various other applications.
☆5,737Updated this week
AlonzoLeeeooo / awesome-image-inpainting-studies
View on GitHub
A collection of awesome image inpainting studies.
☆394Feb 4, 2026Updated 5 months ago
ChenHsing / Awesome-Video-Diffusion-Models
View on GitHub
[CSUR] A Survey on Video Diffusion Models
☆2,307Jun 22, 2026Updated last month
Vchitect / VBench
View on GitHub
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,714Mar 23, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yzhang2016 / video-generation-survey
View on GitHub
A reading list of video generation
☆723Jul 22, 2026Updated last week
Vchitect / Latte
View on GitHub
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
☆1,948Oct 30, 2025Updated 8 months ago
Vchitect / VideoBooth
View on GitHub
[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts
☆309Jun 9, 2024Updated 2 years ago
PRIV-Creation / Awesome-Controllable-T2I-Diffusion-Models
View on GitHub
A collection of resources on controllable generation with text-to-image diffusion models.
☆1,111Dec 31, 2024Updated last year
mayuelala / Awesome-Controllable-Video-Generation
View on GitHub
[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey…
☆760Apr 13, 2026Updated 3 months ago
YingqingHe / Awesome-LLMs-meet-Multimodal-Generation
View on GitHub
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
☆551Apr 4, 2025Updated last year
showlab / Awesome-Unified-Multimodal-Models
View on GitHub
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
☆830Oct 10, 2025Updated 9 months ago
soraw-ai / Awesome-Text-to-Video-Generation
View on GitHub
A list for Text-to-Video, Image-to-Video works
☆255Mar 11, 2026Updated 4 months ago
Kobaayyy / Awesome-CVPR2026-CVPR2025-ICCV2025-CVPR2024-ECCV2026-ECCV2024-AIGC
View on GitHub
A Collection of Papers and Codes for CVPR2026/CVPR2025/ICCV2025/CVPR2024/ECCV2026/ECCV2024 AIGC
☆674Jul 22, 2026Updated last week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
wangkai930418 / awesome-diffusion-categorized
View on GitHub
collection of diffusion model papers categorized by their subareas
☆2,220Mar 16, 2026Updated 4 months ago
wenhao728 / awesome-diffusion-v2v
View on GitHub
Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translati…
☆291Apr 8, 2026Updated 3 months ago
snap-research / Panda-70M
View on GitHub
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
☆700Oct 25, 2024Updated last year
NUS-HPC-AI-Lab / VideoSys
View on GitHub
VideoSys: An easy and efficient system for video generation
☆2,025Aug 27, 2025Updated 11 months ago
aigc-apps / VideoX-Fun
View on GitHub
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
☆2,181Updated this week
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,020Nov 25, 2025Updated 8 months ago
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,440May 7, 2026Updated 2 months ago
VideoVerses / VideoTuna
View on GitHub
Let's finetune video generation models!
☆551Sep 15, 2025Updated 10 months ago
baaivision / NOVA
View on GitHub
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
☆657Oct 29, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
JunyaoHu / common_metrics_on_video_quality
View on GitHub
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
☆582Jan 17, 2026Updated 6 months ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
TIGER-AI-Lab / ConsistI2V
View on GitHub
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]
☆260Jul 1, 2024Updated 2 years ago
CIntellifusion / VideoDPO
View on GitHub
Official Implementation of VideoDPO
☆169Jun 1, 2025Updated last year
ChaofanTao / Autoregressive-Models-in-Vision-Survey
View on GitHub
[TMLR 2025🔥] A survey for the autoregressive models in vision.
☆805May 5, 2026Updated 2 months ago
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,894Updated this week
PKU-YuanGroup / ConsisID
View on GitHub
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
☆848Apr 14, 2026Updated 3 months ago
Ji4chenLi / t2v-turbo
View on GitHub
Code repository for T2V-Turbo and T2V-Turbo-v2
☆312Jan 31, 2025Updated last year
huggingface / finetrainers
View on GitHub
Scalable and memory-optimized training of diffusion models
☆1,355May 26, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Doubiiu / DynamiCrafter
View on GitHub
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
☆3,007Sep 8, 2024Updated last year
ali-vilab / VGen
View on GitHub
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
☆3,155Jan 10, 2025Updated last year
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,693May 31, 2024Updated 2 years ago
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,248Feb 16, 2025Updated last year
zai-org / CogVideo
View on GitHub
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
☆12,921Nov 4, 2025Updated 8 months ago
lxa9867 / Awesome-Autoregressive-Visual-Generation
View on GitHub
This is a repo to track the latest autoregressive visual generation papers.
☆430Jun 25, 2025Updated last year
showlab / Show-o
View on GitHub
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
☆1,965Jan 8, 2026Updated 6 months ago