NVlabs / RoboSpatialLinks
☆48Updated last month
Alternatives and similar repositories for RoboSpatial
Users that are interested in RoboSpatial are comparing it to the libraries listed below
Sorting:
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆112Updated 7 months ago
- ☆49Updated 8 months ago
- Official Repository of SAM2Act☆95Updated 3 weeks ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆107Updated 2 weeks ago
- ☆63Updated 5 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆57Updated last month
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆78Updated 7 months ago
- ☆10Updated last month
- ☆60Updated last week
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆59Updated last week
- Unifying 2D and 3D Vision-Language Understanding☆82Updated last month
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆39Updated 10 months ago
- ☆89Updated 3 weeks ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆65Updated 3 months ago
- ☆20Updated last month
- ☆41Updated last year
- List of papers on video-centric robot learning☆20Updated 6 months ago
- One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆36Updated 10 months ago
- SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆159Updated 3 weeks ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆185Updated last month
- ☆54Updated 3 months ago
- Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.☆92Updated last year
- [CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"☆117Updated last month
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆157Updated 3 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆54Updated 10 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆102Updated last week
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆74Updated 7 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆229Updated 2 months ago
- [IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds☆69Updated 9 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆38Updated 5 months ago