NVlabs / RoboSpatialLinks
☆89Updated 2 months ago
Alternatives and similar repositories for RoboSpatial
Users that are interested in RoboSpatial are comparing it to the libraries listed below
Sorting:
- ☆71Updated 7 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆117Updated 10 months ago
- [CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"☆167Updated 2 months ago
- [ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation☆128Updated last year
- ☆54Updated 7 months ago
- ☆71Updated last week
- Official Repository for SAM2Act☆110Updated this week
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆110Updated 3 months ago
- ✨✨Official implementation of BridgeVLA☆118Updated last month
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆154Updated 2 months ago
- ☆24Updated 3 months ago
- InternRobotics' open platform for building generalized navigation foundation models.☆160Updated this week
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆55Updated 5 months ago
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆41Updated 2 months ago
- ☆42Updated last year
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆83Updated 10 months ago
- SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆188Updated last month
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆90Updated last month
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆182Updated last month
- Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.☆100Updated last year
- PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators☆90Updated 9 months ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆67Updated 5 months ago
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆99Updated 4 months ago
- ☆92Updated last month
- [CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation☆112Updated 10 months ago
- ☆105Updated last year
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆177Updated 2 months ago
- Open-source implementations on real robots☆34Updated 9 months ago
- ☆113Updated 11 months ago
- Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"☆192Updated last week