Video-Reason/Awesome-Video-Reasoning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Video-Reason/Awesome-Video-Reasoning)

Video-Reason / Awesome-Video-Reasoning

This is a collection of recent papers on reasoning in video generation models.

☆165

Alternatives and similar repositories for Awesome-Video-Reasoning

Users that are interested in Awesome-Video-Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LJungang / Awesome-Video-Reasoning-Landscape
View on GitHub
🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.
☆190Jun 14, 2026Updated last month
thuml / MiniVeo3-Reasoner
View on GitHub
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…
☆231Apr 13, 2026Updated 3 months ago
OpenSenseNova / Demystifying_Video_Reasoning
View on GitHub
[ECCV 2026] Demystifying Video Reasoning
☆46Jul 14, 2026Updated 2 weeks ago
FoundationAgents / VR-Bench
View on GitHub
We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…
☆66Feb 4, 2026Updated 5 months ago
Eyeline-Labs / VChain
View on GitHub
[ACL 2026 Findings, ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
☆120Apr 8, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tongjingqi / Thinking-with-Video
View on GitHub
We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…
☆315Jun 21, 2026Updated last month
zjuruizhechen / Awesome-Video-Agent
View on GitHub
A collection of awesome think with videos papers.
☆100Dec 1, 2025Updated 7 months ago
yunlong10 / Awesome-Video-LMM-Post-Training
View on GitHub
🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training
☆296Mar 3, 2026Updated 4 months ago
ZiyuGuo99 / MME-CoF
View on GitHub
Are Video Models Ready as Zero-shot Reasoners?
☆87Nov 24, 2025Updated 8 months ago
Video-Reason / VBVR-Wan2.2
View on GitHub
Official training and inference code for VBVR (A Very Big Video Reasoning Suite)
☆27Apr 9, 2026Updated 3 months ago
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
kinam0252 / TIC-FT
View on GitHub
☆52Jan 6, 2026Updated 6 months ago
VisionChengzhuo / CoF-T2I
View on GitHub
Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.
☆39Jan 16, 2026Updated 6 months ago
ziqihuangg / Awesome-From-Video-Generation-to-World-Model
View on GitHub
A list of works on video generation towards world model
☆504Mar 21, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jialuli-luka / Video-MSG
View on GitHub
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
☆28Apr 14, 2025Updated last year
marinero4972 / Open-o3-Video
View on GitHub
[ICML 2026] Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"
☆158May 1, 2026Updated 2 months ago
PKU-YuanGroup / FlashI2V
View on GitHub
An official implementation of FlashI2V.
☆33Nov 16, 2025Updated 8 months ago
GVCLab / Sci-Fi
View on GitHub
Sci-Fi: Symmetric Constraint for Frame Inbetweening
☆20Aug 12, 2025Updated 11 months ago
thu-ml / Causal-Forcing
View on GitHub
[ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactiv…
☆888Jul 23, 2026Updated last week
KlingAIResearch / MultiShotMaster
View on GitHub
CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"
☆171Feb 22, 2026Updated 5 months ago
knightyxp / VideoCoF
View on GitHub
[CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner
☆205Jun 17, 2026Updated last month
VainF / In-Video-Instructions
View on GitHub
[Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control
☆45Nov 25, 2025Updated 8 months ago
KlingAIResearch / MemFlow
View on GitHub
Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"
☆216Dec 29, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
mll-lab-nu / Awesome-Spatial-Intelligence-in-VLM
View on GitHub
A paper list for spatial reasoning
☆767Jan 19, 2026Updated 6 months ago
mll-lab-nu / MindCube
View on GitHub
☆164Mar 23, 2026Updated 4 months ago
PKU-YuanGroup / Edit-R1
View on GitHub
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
☆295Jan 24, 2026Updated 6 months ago
QuenithAI / Diffusion-Large-Language-Models-Paper-List
View on GitHub
Tracking the latest and greatest research papers on diffusion large language models.
☆32Mar 13, 2026Updated 4 months ago
choi403 / ALG
View on GitHub
Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026 Highlight)
☆59Feb 23, 2026Updated 5 months ago
Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs
View on GitHub
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…
☆1,438May 11, 2026Updated 2 months ago
shengshu-ai / minWM
View on GitHub
A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models
☆749Jun 15, 2026Updated last month
zhaochen0110 / Awesome_Think_With_Images
View on GitHub
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…
☆1,499Mar 9, 2026Updated 4 months ago
zjr2000 / REVERIE
View on GitHub
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
☆20Jul 17, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
cambrian-mllm / cambrian-s
View on GitHub
Cambrian-S: Towards Spatial Supersensing in Video
☆563Apr 3, 2026Updated 3 months ago
yangluo7 / V-ReasonBench
View on GitHub
☆36Feb 18, 2026Updated 5 months ago
meituan-longcat / WBench
View on GitHub
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation
☆174Updated this week
EvolvingLMMs-Lab / ParaVT
View on GitHub
ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
☆54Jun 2, 2026Updated last month
kobeshegu / DiverseDiT
View on GitHub
[CVPR-2026] DiverseDiT: Towards Diverse Representation Learning in Diffusion Transformers
☆20Mar 26, 2026Updated 4 months ago
Vchitect / VBench
View on GitHub
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,714Mar 23, 2026Updated 4 months ago
CIntellifusion / VideoDPO
View on GitHub
Official Implementation of VideoDPO
☆169Jun 1, 2025Updated last year