HaochenZ11 / VLA-3DLinks
☆90Updated 10 months ago
Alternatives and similar repositories for VLA-3D
Users that are interested in VLA-3D are comparing it to the libraries listed below
Sorting:
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆118Updated 7 months ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆73Updated 8 months ago
- ☆47Updated last month
- ☆114Updated last month
- [CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"☆188Updated last month
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆65Updated 8 months ago
- code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"☆37Updated last month
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆144Updated last month
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆43Updated 4 months ago
- Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation (CVPR2024)☆51Updated 7 months ago
- [RAL-25] An online open-vocabulary mapping system that enables natural language querying to navigate dynamic scenes, with ROS support.☆110Updated last week
- ☆78Updated 2 months ago
- [CoRL2023] Open-Vocabulary Scene-Graph☆70Updated last year
- [CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos☆151Updated last month
- [ISER 2023] The official implementation of Audio Visual Language Maps for Robot Navigation☆59Updated last year
- ☆50Updated 3 weeks ago
- Open Vocabulary Object Navigation☆99Updated 6 months ago
- [AAAI 25] The official implementation of Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation☆39Updated 8 months ago
- ☆111Updated last year
- ☆208Updated 3 months ago
- Language-Grounded Dynamic Scene Graphs for Interactive Object Search with Mobile Manipulation. Project website: http://moma-llm.cs.uni-fr…☆95Updated last year
- [ICRA 2025] Official implementation of Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-S…☆83Updated 5 months ago
- ☆51Updated 3 months ago
- Code for OctoNav-R1☆60Updated 4 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆43Updated 4 months ago
- Official repository for LeLaN training and inference code☆123Updated last year
- Official Code for "From Cognition to Precognition: A Future-Aware Framework for Social Navigation" (ICRA 2025)☆89Updated last month
- Official implementation of OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models☆53Updated last year
- [RA-L] Lost & Found dynamically tracks object poses from egocentric videos while updating a scene graph, enabling richer semantic 3D unde…☆51Updated last month
- [RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.☆151Updated last month