DuNGEOnmassster/VideoGen-of-Thought

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DuNGEOnmassster/VideoGen-of-Thought)

DuNGEOnmassster / VideoGen-of-Thought

[Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention

☆63

Alternatives and similar repositories for VideoGen-of-Thought

Users that are interested in VideoGen-of-Thought are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KyleHuang9 / SeFAR
View on GitHub
[AAAI 2025] SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization
☆30Jan 3, 2025Updated last year
GeekGuru123 / ProfilingDiT
View on GitHub
☆20Jan 1, 2026Updated 6 months ago
LAW1223 / AlignVid
View on GitHub
☆23May 29, 2026Updated last month
XianfengWu01 / LightGen
View on GitHub
An Efficient Text-to-Image Generation Pretrain Pipeline
☆132Apr 18, 2025Updated last year
EnVision-Research / ScalingAR
View on GitHub
[ICML 2026] ScalingAR: Scaling Confidence for Autoregressive Image Generation
☆22May 5, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DuNGEOnmassster / awesome-customized-generative-AI
View on GitHub
Papers and codes collection for customized, personalized and editable generative models
☆28Oct 1, 2024Updated last year
HaroldChen19 / VistaDPO
View on GitHub
[ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
☆41Jun 14, 2025Updated last year
PKU-YuanGroup / Next-Patch-Prediction
View on GitHub
[AAAI26] Next Patch Prediction
☆129Jan 2, 2025Updated last year
LAW1223 / OpenSubject
View on GitHub
☆55Dec 10, 2025Updated 7 months ago
hustvl / 4DLangVGGT
View on GitHub
Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”
☆89Mar 25, 2026Updated 3 months ago
Cheliosoops / BitQ
View on GitHub
☆10Apr 24, 2024Updated 2 years ago
EnVision-Research / TiViBench
View on GitHub
[CVPR 2026] TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models
☆67Feb 21, 2026Updated 4 months ago
SHYuanBest / LHNet
View on GitHub
Offical PyTorch implementation of LHNet (ACM MM 2023)
☆14Feb 23, 2024Updated 2 years ago
WenjieShu / LoopViT
View on GitHub
☆45Feb 4, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
xgen-universe / Capybara
View on GitHub
☆202Feb 27, 2026Updated 4 months ago
Tencent-Hunyuan / SAGE-GRPO
View on GitHub
Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation
☆126Apr 2, 2026Updated 3 months ago
aiming-lab / MJ-Video
View on GitHub
[NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
☆20Feb 23, 2025Updated last year
CIntellifusion / VideoDPO
View on GitHub
Official Implementation of VideoDPO
☆169Jun 1, 2025Updated last year
GVCLab / Sci-Fi
View on GitHub
Sci-Fi: Symmetric Constraint for Frame Inbetweening
☆20Aug 12, 2025Updated 11 months ago
BAAI-DCAI / MMVU
View on GitHub
☆57Mar 19, 2025Updated last year
JosephTiTan / FreePCA
View on GitHub
Code of the paper "FreePCA：Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…
☆27Apr 3, 2026Updated 3 months ago
VideoVerses / VideoTuna
View on GitHub
Let's finetune video generation models!
☆551Sep 15, 2025Updated 10 months ago
microsoft / distilled_decoding
View on GitHub
[ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
☆19Apr 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
EnVision-Research / LatentMorph
View on GitHub
[ICML 2026] LatentMorph: Morphing Latent Reasoning into Image Generation
☆47May 5, 2026Updated 2 months ago
longvideoagent / LongVideoAgent
View on GitHub
☆120Apr 8, 2026Updated 3 months ago
ByteDance-Seed / VINCIE
View on GitHub
Official code for VINCIE: Unlocking In-context Image Editing from Video
☆60Jun 19, 2026Updated last month
nexuslrf / composition_rendering
View on GitHub
☆109Oct 17, 2025Updated 9 months ago
HKU-MMLab / Macro
View on GitHub
The official repo of "MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data"
☆66Mar 27, 2026Updated 3 months ago
QwenLM / Qwen-Image-Bench
View on GitHub
☆128Jun 18, 2026Updated last month
jiaxinxie97 / HFGI3D
View on GitHub
☆206Jun 14, 2024Updated 2 years ago
TencentARC / DiTCtrl
View on GitHub
[CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…
☆323Mar 30, 2025Updated last year
showlab / MovieAgent
View on GitHub
MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning
☆349Mar 26, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
YingqingHe / Awesome-LLMs-meet-Multimodal-Generation
View on GitHub
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
☆549Apr 4, 2025Updated last year
JethroJames / TUNED
View on GitHub
[AAAI 2025] Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification
☆20Apr 17, 2025Updated last year
zhengdian1 / AIA
View on GitHub
☆45Jan 4, 2026Updated 6 months ago
ypwang61 / StoryEval
View on GitHub
[CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
☆20May 2, 2025Updated last year
Junchao-cs / LIVE
View on GitHub
[ICML 2026] "LIVE: Long-horizon Interactive Video World ModEling"
☆35Updated this week
xyq7 / Human-Contribution-Measurement
View on GitHub
☆13Jun 4, 2025Updated last year
tkpham3105 / TALE
View on GitHub
[ACM MM 2024] Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization
☆20Jun 29, 2026Updated 3 weeks ago