[ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
☆115Oct 7, 2025Updated 4 months ago
Alternatives and similar repositories for VChain
Users that are interested in VChain are comparing it to the libraries listed below
Sorting:
- ☆27Oct 5, 2023Updated 2 years ago
- Toolbox for GTA-Human Datasets☆25Oct 9, 2024Updated last year
- Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"☆132Dec 18, 2025Updated 2 months ago
- ☆43Dec 1, 2025Updated 3 months ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆84Jul 10, 2025Updated 7 months ago
- [NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens☆20Oct 12, 2025Updated 4 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 2 months ago
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- ☆21Feb 13, 2026Updated 2 weeks ago
- A framework that allows you to apply Sparse AutoEncoder on any models☆51Jul 11, 2025Updated 7 months ago
- [ICLR 2026] LongLive: Real-time Interactive Long Video Generation☆1,077Feb 26, 2026Updated last week
- [ICLR 2026] Official Code for "the Quest for Generalizable Motion Generation: Data, Model, and Evaluation"☆76Feb 12, 2026Updated 3 weeks ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆38Jan 9, 2026Updated last month
- Cut2Next: Generating Next Shot via In-Context Tuning☆31Aug 21, 2025Updated 6 months ago
- Block-Recurrent Dynamics in ViTs 🦖☆31Dec 24, 2025Updated 2 months ago
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Oct 5, 2025Updated 5 months ago
- ☆40Mar 3, 2024Updated 2 years ago
- Official Repo for Self-Forcing++ High Quality Long Video Generation☆237Oct 13, 2025Updated 4 months ago
- ☆56Dec 8, 2025Updated 2 months ago
- [CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models☆50Feb 21, 2026Updated last week
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated 9 months ago
- [CVPR 2025] MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention☆40Mar 12, 2025Updated 11 months ago
- [NeurIPS 2023] PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation☆118Dec 8, 2023Updated 2 years ago
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆60Nov 27, 2025Updated 3 months ago
- Pytorch implementation for "Iterative Human and Automated Identification of Wildlife Images" (Nature -Machine Intelligence, 2021)☆19Nov 3, 2021Updated 4 years ago
- Toolbox for HuMMan Dataset☆126Dec 7, 2024Updated last year
- Are Video Models Ready as Zero-shot Reasoners?☆84Nov 24, 2025Updated 3 months ago
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆92Dec 1, 2025Updated 3 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆39Jul 22, 2025Updated 7 months ago
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆167Dec 11, 2025Updated 2 months ago
- ☆40Dec 16, 2025Updated 2 months ago
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆99Jan 1, 2026Updated 2 months ago
- 🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems☆139Feb 1, 2026Updated last month
- [ICLR 2026] NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics☆121Jan 26, 2026Updated last month
- A Benchmark for Evaluating MLLMs' Geometry Performance on Long-Step Problems Requiring Auxiliary Lines☆32Updated this week
- VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs☆48Jan 5, 2026Updated 2 months ago
- GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors☆116Nov 6, 2025Updated 3 months ago
- ☆86Jan 2, 2024Updated 2 years ago
- [CVPR 2025 Highlight] MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation☆64May 9, 2025Updated 9 months ago