[ICLR 2026] MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, realistic, and adaptive scene generation for applications in the metaverse, AR/VR, and game development.
☆207May 5, 2025Updated last year
Alternatives and similar repositories for MetaSpatial
Users that are interested in MetaSpatial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jan 9, 2026Updated 4 months ago
- [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…☆64Feb 12, 2025Updated last year
- Reconstructing spatiotemporal dynamics from spatial transcriptome snapshots☆62Apr 12, 2026Updated last month
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 11 months ago
- Official code for DeepSound-V1☆12May 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆36Aug 12, 2025Updated 9 months ago
- [CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation☆89Nov 29, 2025Updated 5 months ago
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…☆30Sep 7, 2025Updated 8 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.☆62Dec 9, 2025Updated 5 months ago
- Official repo and evaluation implementation of VSI-Bench☆708Aug 5, 2025Updated 9 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆378Oct 21, 2025Updated 6 months ago
- Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning☆33Mar 15, 2026Updated 2 months ago
- Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models" (CVPR 2025)☆172Jun 18, 2025Updated 11 months ago
- We have released official implementation in https://github.com/VAST-AI-Research/MIDI-3D☆128Mar 12, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A paper list for spatial reasoning☆742Jan 19, 2026Updated 4 months ago
- Embodied Reasoning Question Answer (ERQA) Benchmark☆273Mar 12, 2025Updated last year
- [NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆464Feb 5, 2026Updated 3 months ago
- Embodied Intelligence in Endovascular Robot Navigation -- 血管介入手术机器人具身导航☆45Jan 8, 2026Updated 4 months ago
- [🏆AAAI'25] Official Repo for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area.☆82Apr 14, 2026Updated last month
- [NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling☆4,552Sep 26, 2025Updated 7 months ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆61Jan 23, 2025Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆318Oct 12, 2025Updated 7 months ago
- Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)☆202May 20, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated 11 months ago
- CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.☆548Apr 2, 2025Updated last year
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆86Jan 5, 2026Updated 4 months ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆409Dec 22, 2025Updated 4 months ago
- ☆192Jul 25, 2025Updated 9 months ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated 2 years ago
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆81Sep 12, 2025Updated 8 months ago
- ☆95Mar 19, 2025Updated last year
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆201Mar 16, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Orient Anything, ICML 2025☆381Feb 6, 2026Updated 3 months ago
- [NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆96Jul 27, 2025Updated 9 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆344Sep 14, 2025Updated 8 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆247Updated this week
- Explore the Multimodal “Aha Moment” on 2B Model☆623Mar 18, 2025Updated last year
- [ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI☆669Jan 12, 2026Updated 4 months ago
- Official implementation of NeurIPS 2025 paper "SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent"☆154Nov 13, 2025Updated 6 months ago