thuml/MiniVeo3-Reasoner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thuml/MiniVeo3-Reasoner)

thuml / MiniVeo3-Reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

☆230

Alternatives and similar repositories for MiniVeo3-Reasoner

Users that are interested in MiniVeo3-Reasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thuml / Reasoning-Visual-World
View on GitHub
Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…
☆100Mar 9, 2026Updated 4 months ago
Video-Reason / Awesome-Video-Reasoning
View on GitHub
This is a collection of recent papers on reasoning in video generation models.
☆165Updated this week
Eyeline-Labs / VChain
View on GitHub
[ACL 2026 Findings, ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
☆120Apr 8, 2026Updated 3 months ago
cambrian-mllm / cambrian-s
View on GitHub
Cambrian-S: Towards Spatial Supersensing in Video
☆563Apr 3, 2026Updated 3 months ago
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,538Dec 30, 2025Updated 6 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
tongjingqi / Thinking-with-Video
View on GitHub
We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…
☆315Jun 21, 2026Updated last month
FoundationAgents / VR-Bench
View on GitHub
We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…
☆66Feb 4, 2026Updated 5 months ago
shengshu-ai / minWM
View on GitHub
A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models
☆740Jun 15, 2026Updated last month
ZiyuGuo99 / MME-CoF
View on GitHub
Are Video Models Ready as Zero-shot Reasoners?
☆87Nov 24, 2025Updated 8 months ago
TencentARC / RollingForcing
View on GitHub
[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
☆445Oct 31, 2025Updated 8 months ago
SOTAMak1r / Infinite-Forcing
View on GitHub
Infinite-Forcing: Towards Infinite-Long Video Generation
☆155Nov 13, 2025Updated 8 months ago
stdstu12 / YUME
View on GitHub
The official code of Yume
☆679Jan 14, 2026Updated 6 months ago
ByteDance-Seed / Seed-1.8
View on GitHub
☆219Dec 19, 2025Updated 7 months ago
JaydenLyh / Reward-Forcing
View on GitHub
[CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
☆352Dec 15, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cheryyunl / ROVER
View on GitHub
Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
☆26Dec 12, 2025Updated 7 months ago
LJungang / Awesome-Video-Reasoning-Landscape
View on GitHub
🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.
☆189Jun 14, 2026Updated last month
yangzhou24 / OmniWorld
View on GitHub
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆485Apr 16, 2026Updated 3 months ago
Robbyant / lingbot-video
View on GitHub
Scaling Mixture-of-Experts Video Pretraining for Embodied Intelligence
☆871Jul 10, 2026Updated 2 weeks ago
para-lost / ECHO
View on GitHub
Echo: "Constantly Improving Image Models Need Constantly Improving Benchmarks" (ICLR 2026)
☆20Jan 29, 2026Updated 5 months ago
HorizonWind2004 / reconstruction-alignment
View on GitHub
[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…
☆411May 23, 2026Updated 2 months ago
thu-ml / Causal-Forcing
View on GitHub
[ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactiv…
☆880Updated this week
NVlabs / DiffusionNFT
View on GitHub
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
☆985Feb 10, 2026Updated 5 months ago
ziqihuangg / Awesome-From-Video-Generation-to-World-Model
View on GitHub
A list of works on video generation towards world model
☆502Mar 21, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
solaris-wm / solaris
View on GitHub
The first multiplayer video world model in Minecraft
☆219Mar 3, 2026Updated 4 months ago
NVlabs / rcm
View on GitHub
rCM & Causal-rCM: Leading and Unified Algorithms/Infrastructures for Bidirectional/Autoregressive Video Diffusion Distillation at Scale
☆772Jun 25, 2026Updated last month
THU-SI / Spatial-TTT
View on GitHub
[ECCV 2026] Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
☆241Jun 19, 2026Updated last month
zjuruizhechen / Awesome-Video-Agent
View on GitHub
A collection of awesome think with videos papers.
☆100Dec 1, 2025Updated 7 months ago
ThinkMorph / ThinkMorph
View on GitHub
[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
☆192May 1, 2026Updated 2 months ago
Seed3D / Seed3D
View on GitHub
☆213Oct 22, 2025Updated 9 months ago
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,978Feb 25, 2026Updated 5 months ago
PKU-YuanGroup / UniSandBox
View on GitHub
Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward
☆60Nov 27, 2025Updated 8 months ago
NJU-3DV / SpatialVID
View on GitHub
[CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
☆589Apr 22, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenDCAI / OpenWorldLib
View on GitHub
Unified Codebase for Advanced World Models.
☆846Updated this week
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,880Updated this week
SunYangtian / UniGeo
View on GitHub
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
☆136Jun 10, 2025Updated last year
lcqysl / DiffThinker
View on GitHub
[ICML 2026] Official repo for "DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models"
☆186Jan 4, 2026Updated 6 months ago
PKU-YuanGroup / UAE
View on GitHub
Official repository for the UAE paper, unified-GRPO, and unified-Bench
☆165Sep 12, 2025Updated 10 months ago
NVlabs / LongLive
View on GitHub
Long Video Gen Infrastructure
☆2,493Jul 15, 2026Updated last week
KlingAIResearch / VideoCanvas
View on GitHub
Official Code of "VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning"
☆68Oct 10, 2025Updated 9 months ago