MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, realistic, and adaptive scene generation for applications in the metaverse, AR/VR, and game development.
☆206May 5, 2025Updated 11 months ago
Alternatives and similar repositories for MetaSpatial
Users that are interested in MetaSpatial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jan 9, 2026Updated 3 months ago
- Reconstructing spatiotemporal dynamics from spatial transcriptome snapshots☆57Mar 17, 2026Updated 3 weeks ago
- [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…☆64Feb 12, 2025Updated last year
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 10 months ago
- Official code for DeepSound-V1☆13May 14, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆36Aug 12, 2025Updated 7 months ago
- [CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation☆88Nov 29, 2025Updated 4 months ago
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…☆29Sep 7, 2025Updated 7 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.☆61Dec 9, 2025Updated 4 months ago
- Official repo and evaluation implementation of VSI-Bench☆694Aug 5, 2025Updated 8 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆376Oct 21, 2025Updated 5 months ago
- Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning☆31Mar 15, 2026Updated 3 weeks ago
- Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models" (CVPR 2025)☆165Jun 18, 2025Updated 9 months ago
- We have released official implementation in https://github.com/VAST-AI-Research/MIDI-3D☆128Mar 12, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A paper list for spatial reasoning☆706Jan 19, 2026Updated 2 months ago
- Embodied Reasoning Question Answer (ERQA) Benchmark☆266Mar 12, 2025Updated last year
- [NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆457Feb 5, 2026Updated 2 months ago
- [🏆AAAI'25] Official Repo for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area.☆71Feb 11, 2026Updated last month
- Embodied Intelligence in Endovascular Robot Navigation -- 血管介入手术机器人具身导航☆80Jan 8, 2026Updated 3 months ago
- [NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling☆4,494Sep 26, 2025Updated 6 months ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆60Jan 23, 2025Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆316Oct 12, 2025Updated 5 months ago
- Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)☆201May 20, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated 10 months ago
- CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.☆537Apr 2, 2025Updated last year
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆84Jan 5, 2026Updated 3 months ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆401Dec 22, 2025Updated 3 months ago
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆79Sep 12, 2025Updated 6 months ago
- ☆187Jul 25, 2025Updated 8 months ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated 2 years ago
- [CVPR 2026] M3DLayout-A-Multi-Source-Dataset-of-3D-Indoor-Layouts-and-Structured-Descriptions.☆47Apr 1, 2026Updated last week
- ☆95Mar 19, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆200Mar 16, 2024Updated 2 years ago
- Orient Anything, ICML 2025☆377Feb 6, 2026Updated 2 months ago
- [NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆94Jul 27, 2025Updated 8 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆340Sep 14, 2025Updated 6 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆236Oct 17, 2025Updated 5 months ago
- Explore the Multimodal “Aha Moment” on 2B Model☆622Mar 18, 2025Updated last year
- [ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI☆660Jan 12, 2026Updated 2 months ago