NVlabs / RoboSpatial
☆30Updated last week
Alternatives and similar repositories for RoboSpatial:
Users that are interested in RoboSpatial are comparing it to the libraries listed below
- ☆49Updated 7 months ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆101Updated last week
- Unifying 2D and 3D Vision-Language Understanding☆79Updated 3 weeks ago
- Code of 3DMIT: 3D MULTI-MODAL INSTRUCTION TUNING FOR SCENE UNDERSTANDING☆30Updated 9 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆53Updated 9 months ago
- ☆16Updated 10 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆52Updated 2 weeks ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆139Updated 2 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆71Updated 7 months ago
- List of papers on video-centric robot learning☆19Updated 5 months ago
- ☆59Updated 4 months ago
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆27Updated 2 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆75Updated 9 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆110Updated 7 months ago
- SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆147Updated last month
- Official Repository of SAM2Act☆88Updated last month
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆38Updated 9 months ago
- ☆46Updated 4 months ago
- [CoRL2023] Open-Vocabulary Scene-Graph☆66Updated last year
- One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆34Updated 9 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆131Updated last year
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆37Updated 5 months ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆104Updated last month
- Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.☆88Updated 11 months ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆62Updated 2 months ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆115Updated 2 weeks ago
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos☆41Updated last month
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆60Updated this week
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆77Updated 6 months ago
- [IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds☆68Updated 8 months ago