MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, realistic, and adaptive scene generation for applications in the metaverse, AR/VR, and game development.
☆205May 5, 2025Updated 10 months ago
Alternatives and similar repositories for MetaSpatial
Users that are interested in MetaSpatial are comparing it to the libraries listed below
Sorting:
- ☆23Jan 9, 2026Updated 2 months ago
- Reconstructing spatiotemporal dynamics from spatial transcriptome snapshots☆45Updated this week
- [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…☆63Feb 12, 2025Updated last year
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 9 months ago
- Official code for DeepSound-V1☆13May 14, 2025Updated 10 months ago
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆35Aug 12, 2025Updated 7 months ago
- [CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation☆87Nov 29, 2025Updated 3 months ago
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…☆29Sep 7, 2025Updated 6 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.☆62Dec 9, 2025Updated 3 months ago
- Official repo and evaluation implementation of VSI-Bench☆682Aug 5, 2025Updated 7 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆374Oct 21, 2025Updated 5 months ago
- Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning☆30Jun 6, 2025Updated 9 months ago
- Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models" (CVPR 2025)☆163Jun 18, 2025Updated 9 months ago
- We have released official implementation in https://github.com/VAST-AI-Research/MIDI-3D☆127Mar 12, 2025Updated last year
- A paper list for spatial reasoning☆683Jan 19, 2026Updated 2 months ago
- Embodied Reasoning Question Answer (ERQA) Benchmark☆263Mar 12, 2025Updated last year
- [🏆AAAI2025] Official Repo for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area.☆70Feb 11, 2026Updated last month
- [NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆451Feb 5, 2026Updated last month
- Embodied Intelligence in Endovascular Robot Navigation -- 血管介入手术机器人具身导航☆100Jan 8, 2026Updated 2 months ago
- [NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling☆4,283Sep 26, 2025Updated 5 months ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆59Jan 23, 2025Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆310Oct 12, 2025Updated 5 months ago
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆81Jan 5, 2026Updated 2 months ago
- Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)☆200May 20, 2025Updated 10 months ago
- CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.☆528Apr 2, 2025Updated 11 months ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated 9 months ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆398Dec 22, 2025Updated 3 months ago
- Create your own 3D scene with words anywhere.☆34Updated this week
- ☆186Jul 25, 2025Updated 7 months ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated last year
- ☆95Mar 19, 2025Updated last year
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆199Mar 16, 2024Updated 2 years ago
- Orient Anything, ICML 2025☆376Feb 6, 2026Updated last month
- [NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆92Jul 27, 2025Updated 7 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆231Oct 17, 2025Updated 5 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆338Sep 14, 2025Updated 6 months ago
- Official implementation of NeurIPS 2025 paper "SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent"☆138Nov 13, 2025Updated 4 months ago
- Explore the Multimodal “Aha Moment” on 2B Model☆624Mar 18, 2025Updated last year
- [ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI☆653Jan 12, 2026Updated 2 months ago