[TMLR] Video Generation Models: A Survey of Post-Training and Alignment | π₯ A continuously updated collection of papers, datasets, and benchmarks on post-training and alignment for video generation.
β159Jun 10, 2026Updated last week
Alternatives and similar repositories for Awesome-Video-Generation-Post-Training
Users that are interested in Awesome-Video-Generation-Post-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Are Video Models Ready as Zero-shot Reasoners?β87Nov 24, 2025Updated 6 months ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detectionβ17Mar 23, 2025Updated last year
- [ICLR 2025] FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noiseβ14Mar 5, 2025Updated last year
- Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).β13Mar 25, 2022Updated 4 years ago
- [NeurIPS 2024] Official repository for downloading and using LAVIBβ24Aug 6, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official repo for the DanQing dataset.β36Mar 25, 2026Updated 2 months ago
- Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulationβ131Apr 28, 2026Updated last month
- Official implementation of "Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals" (CVPR 2026)β39Feb 25, 2026Updated 3 months ago
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023β12Sep 21, 2023Updated 2 years ago
- β10Apr 24, 2024Updated 2 years ago
- An innovative method designed to augment the capabilities of existing video diffusion modelsβ22May 10, 2024Updated 2 years ago
- β22Nov 16, 2025Updated 7 months ago
- A curated list of zero-shot captioning papersβ24Aug 26, 2023Updated 2 years ago
- Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework fβ¦β28Nov 4, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICCV2025] The official code of "DreamRelation: Relation-Centric Video Customization"β26Feb 4, 2026Updated 4 months ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.β39Feb 13, 2025Updated last year
- Offical PyTorch implementation of LHNet (ACM MM 2023)β14Feb 23, 2024Updated 2 years ago
- Full model implementation for Flow Equivariant World Models (ICML 2026), world models with memory for dynamic scenesβ45May 21, 2026Updated 3 weeks ago
- β38Jun 2, 2026Updated 2 weeks ago
- β141Dec 19, 2025Updated 5 months ago
- The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]β24Mar 17, 2025Updated last year
- Offical respority for Gait Recogniton with Drones: A benchmark (TMM 2023)β10Feb 2, 2024Updated 2 years ago
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.β119Aug 27, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ICML 2025 Papers: Dive into cutting-edge research from the premier machine learning conference. Stay current with breakthroughs in deep lβ¦β38Oct 24, 2025Updated 7 months ago
- [π³Software] A poweful C++ Plant Generation System by using modular procedural graphs. This is a free version with most general features.β¦β125Jan 29, 2026Updated 4 months ago
- εΊδΊε½ζ°εΌηΌη¨ε dio ε°θ£ ηη±»δΌΌ ahooks η useRequest η½η»θ―·ζ±εΊβ82Mar 5, 2026Updated 3 months ago
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learningβ44Apr 13, 2026Updated 2 months ago
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customizationβ21Jul 13, 2025Updated 11 months ago
- D-JEPA on ImageNetβ23Nov 18, 2024Updated last year
- Official implementation of "MV-TAP: Tracking Any Point in Multi-View Videos"