UMass-Embodied-AGI / MindJourneyLinks
Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"
☆30Updated this week
Alternatives and similar repositories for MindJourney
Users that are interested in MindJourney are comparing it to the libraries listed below
Sorting:
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆135Updated last month
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.☆44Updated 9 months ago
- ☆23Updated 3 months ago
- [ICLR 2025] Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆41Updated last week
- Code implementation for: From Virtual Games to Real-World Play☆35Updated 3 weeks ago
- [CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆44Updated last year
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆133Updated 3 weeks ago
- This is the project page of ShowRoom3D☆25Updated last year
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆36Updated 5 months ago
- This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token"☆34Updated last month
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆39Updated last month
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆39Updated 7 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆118Updated 2 weeks ago
- [CVPR 2024] BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation☆46Updated last year
- ☆84Updated last month
- GECO: Generative Image-to-3D within a SECOnd☆65Updated 9 months ago
- [TMLR 2025] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆17Updated 5 months ago
- ☆131Updated 6 months ago
- PhysX: Physical-Grounded 3D Asset Generation☆113Updated this week
- ☆40Updated 11 months ago
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation" (ICLR2025)☆71Updated 3 months ago
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆74Updated 2 weeks ago
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆81Updated this week
- The official implementation of "Compositional Generative Model of Unbounded 4D Cities". (arXiv 2501.08983)☆101Updated 6 months ago
- ☆29Updated 2 months ago
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆64Updated 9 months ago
- [CVPR 2025 Highlight] MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation☆48Updated 2 months ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆60Updated 9 months ago
- Official repo for StyleMe3D☆24Updated 2 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆114Updated last month