BAAI-DCAI / SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
☆219Updated last month
Alternatives and similar repositories for SpatialBot:
Users that are interested in SpatialBot are comparing it to the libraries listed below
- ☆299Updated last month
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆94Updated 2 months ago
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation☆191Updated last month
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆235Updated 10 months ago
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆417Updated 2 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆182Updated last week
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆94Updated last month
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆115Updated last month
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆450Updated 4 months ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆118Updated 8 months ago
- The Official Implementation of RoboMatrix☆83Updated 2 months ago
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆82Updated last month
- Official implementation of "Data Scaling Laws in Imitation Learning for Robotic Manipulation"☆154Updated 4 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆127Updated 4 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆174Updated last week
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆215Updated 4 months ago
- ☆62Updated last month
- ☆50Updated last month
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆142Updated 3 months ago
- [arXiv 2023] Embodied Task Planning with Large Language Models☆170Updated last year
- Code for RoboFlamingo☆358Updated 10 months ago
- ☆156Updated 3 weeks ago