MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, realistic, and adaptive scene generation for applications in the metaverse, AR/VR, and game development.
☆202May 5, 2025Updated 9 months ago
Alternatives and similar repositories for MetaSpatial
Users that are interested in MetaSpatial are comparing it to the libraries listed below
Sorting:
- ☆23Jan 9, 2026Updated last month
- Reconstructing spatiotemporal dynamics from spatial transcriptome snapshots☆34Jun 26, 2025Updated 8 months ago
- [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…☆63Feb 12, 2025Updated last year
- Official repo and evaluation implementation of VSI-Bench☆673Aug 5, 2025Updated 6 months ago
- [CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation☆83Nov 29, 2025Updated 3 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆373Oct 21, 2025Updated 4 months ago
- Official code for DeepSound-V1☆13May 14, 2025Updated 9 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 9 months ago
- We have released official implementation in https://github.com/VAST-AI-Research/MIDI-3D☆127Mar 12, 2025Updated 11 months ago
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆35Aug 12, 2025Updated 6 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆438Feb 5, 2026Updated 3 weeks ago
- Embodied Reasoning Question Answer (ERQA) Benchmark☆261Mar 12, 2025Updated 11 months ago
- Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning☆30Jun 6, 2025Updated 8 months ago
- Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models" (CVPR 2025)☆162Jun 18, 2025Updated 8 months ago
- A paper list for spatial reasoning☆661Jan 19, 2026Updated last month
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…☆28Sep 7, 2025Updated 5 months ago
- Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)☆199May 20, 2025Updated 9 months ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated last year
- [NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling☆4,245Sep 26, 2025Updated 5 months ago
- Create your own 3D scene with words anywhere.☆29Updated this week
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆78Jan 5, 2026Updated last month
- Orient Anything, ICML 2025☆374Feb 6, 2026Updated 3 weeks ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆60Jun 6, 2025Updated 8 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆304Oct 12, 2025Updated 4 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.☆62Dec 9, 2025Updated 2 months ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆398Dec 22, 2025Updated 2 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆226Oct 17, 2025Updated 4 months ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆59Jan 23, 2025Updated last year
- [NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆90Jul 27, 2025Updated 7 months ago
- [ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI☆645Jan 12, 2026Updated last month
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆198Mar 16, 2024Updated last year
- Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View, 3DV2025☆190Mar 21, 2025Updated 11 months ago
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆97May 13, 2025Updated 9 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆335Sep 14, 2025Updated 5 months ago
- [ICLR 2025] EdgeRunner: Auto-regressive Auto-encoder for Efficient Mesh Generation☆299Dec 22, 2024Updated last year
- Embodied Intelligence in Endovascular Robot Navigation -- 血管介入手术机器人具身导航☆100Jan 8, 2026Updated last month
- Official implementation of NeurIPS 2025 paper "SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent"☆132Nov 13, 2025Updated 3 months ago
- Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]☆831Dec 14, 2025Updated 2 months ago
- Spatial Aptitude Training for Multimodal Langauge Models☆24Feb 8, 2026Updated 3 weeks ago