UMass-Embodied-AGI/MindJourney

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UMass-Embodied-AGI/MindJourney)

UMass-Embodied-AGI / MindJourney

[NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"

☆151

Alternatives and similar repositories for MindJourney

Users that are interested in MindJourney are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mengcaopku / SpatialDreamer
View on GitHub
SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery
☆15Feb 1, 2026Updated 5 months ago
LaVi-Lab / VG-LLM
View on GitHub
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
☆248Nov 28, 2025Updated 8 months ago
VITA-Group / VLM-3R
View on GitHub
[CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
☆431Jul 15, 2026Updated 2 weeks ago
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
vision-x-nyu / thinking-in-space
View on GitHub
Official repo and evaluation implementation of VSI-Bench
☆734Aug 5, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
InternRobotics / MMSI-Bench
View on GitHub
[ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
☆106Apr 28, 2026Updated 3 months ago
UMass-Embodied-AGI / 3D-Mem
View on GitHub
[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"
☆269Oct 2, 2025Updated 9 months ago
THU-SI / Spatial-MLLM
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆480Feb 5, 2026Updated 5 months ago
arijitray1993 / SAT
View on GitHub
Spatial Aptitude Training for Multimodal Langauge Models
☆33Feb 8, 2026Updated 5 months ago
facebookresearch / nwm
View on GitHub
Official code for the CVPR 2025 paper "Navigation World Models".
☆658Nov 24, 2025Updated 8 months ago
Mars-tin / fast-spatial-mem
View on GitHub
Fast Spatial Memory with Elastic Test-Time Training (4D-LRM + 4D-LVSM)
☆103Jun 20, 2026Updated last month
qizekun / OmniSpatial
View on GitHub
[ICLR 2026] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
☆89Jan 21, 2026Updated 6 months ago
THU-SI / LangScene-X
View on GitHub
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
☆302Jul 15, 2025Updated last year
UMass-Embodied-AGI / TesserAct
View on GitHub
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
☆403Aug 4, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
InternRobotics / G2VLM
View on GitHub
[CVPR 2026] G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
☆346Apr 18, 2026Updated 3 months ago
mll-lab-nu / MindCube
View on GitHub
☆164Mar 23, 2026Updated 4 months ago
spacetools / SpaceTools
View on GitHub
code release
☆38Jun 22, 2026Updated last month
mll-lab-nu / Awesome-Spatial-Intelligence-in-VLM
View on GitHub
A paper list for spatial reasoning
☆767Jan 19, 2026Updated 6 months ago
liziwennba / SURPRISE3D
View on GitHub
☆22Apr 14, 2026Updated 3 months ago
InternRobotics / Aether
View on GitHub
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
☆604Oct 26, 2025Updated 9 months ago
InternLM / Spatial-SSRL
View on GitHub
[CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"
☆133Apr 7, 2026Updated 3 months ago
cheolhong0916 / contrastive-probing
View on GitHub
☆16Jun 19, 2026Updated last month
Visual-AI / 3DRS
View on GitHub
[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆158Dec 9, 2025Updated 7 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ESI-Bench / ESI-Bench
View on GitHub
☆119Jul 19, 2026Updated last week
gca-spatial-reasoning / gca
View on GitHub
Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"
☆91Apr 7, 2026Updated 3 months ago
zhangzaibin / spagent
View on GitHub
SPAgent, a foundation agent for understanding, reasoning over, and operating within the physical and spatial world.
☆209Updated this week
UMass-Embodied-AGI / Mirage
View on GitHub
[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
☆294Aug 2, 2025Updated 11 months ago
KAIST-Visual-AI-Group / APC-VLM
View on GitHub
[ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
☆66Sep 12, 2025Updated 10 months ago
hustvl / Spa3R
View on GitHub
Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning
☆51Mar 25, 2026Updated 4 months ago
GenSim2 / GenSim2
View on GitHub
☆85May 9, 2026Updated 2 months ago
THU-SI / Spatial-TTT
View on GitHub
[ECCV 2026] Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
☆243Jun 19, 2026Updated last month
NVlabs / RoboSpatial
View on GitHub
☆147Jun 17, 2026Updated last month
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
EmbodiedBench / EmbodiedBench
View on GitHub
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆320May 30, 2026Updated last month
MTU3D / MTU3D
View on GitHub
☆266Aug 6, 2025Updated 11 months ago
Yangr116 / VST
View on GitHub
[ECCV2026] Visual Spatial Tuning
☆200Mar 25, 2026Updated 4 months ago
vbdi / Ego3D-Bench
View on GitHub
[ICLR2026] Spatial Reasoning with Vision-Language Models
☆60Jan 26, 2026Updated 6 months ago
zhangquanchen / 3DThinker
View on GitHub
[CVPR 2026] Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
☆244May 7, 2026Updated 2 months ago
Zhoues / RoboRefer
View on GitHub
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
☆265Dec 16, 2025Updated 7 months ago
cambrian-mllm / cambrian-s
View on GitHub
Cambrian-S: Towards Spatial Supersensing in Video
☆563Apr 3, 2026Updated 3 months ago