intelligolabs / R2RIE-CELinks
Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We present the first dataset - R2R-IE-CE - to benchmark instructions errors in VLN. We then propose a method, IEDL.
☆16Updated 9 months ago
Alternatives and similar repositories for R2RIE-CE
Users that are interested in R2RIE-CE are comparing it to the libraries listed below
Sorting:
- Official implementation of OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models☆53Updated last year
- Official implementation of GridMM: Grid Memory Map for Vision-and-Language Navigation (ICCV'23).☆97Updated last year
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆58Updated 7 months ago
- [ICRA 2025] Official implementation of Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-S…☆77Updated 4 months ago
- ☆23Updated last year
- ☆45Updated 2 months ago
- Open Vocabulary Object Navigation☆92Updated 5 months ago
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆38Updated 3 months ago
- Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation (CVPR2024)☆48Updated 6 months ago
- ☆36Updated last year
- Code and Data for Paper: Boosting Efficient Reinforcement Learning for Vision-and-Language Navigation With Open-Sourced LLM☆15Updated 8 months ago
- official implementation for ECCV 2024 paper "Prioritized Semantic Learning for Zero-shot Instance Navigation"☆42Updated 4 months ago
- Awesome habitat top down map work 🤩☆31Updated last year
- ☆108Updated last year
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆72Updated 7 months ago
- This is the source code to paper “DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation”.☆19Updated 2 months ago
- ☆44Updated 2 years ago
- [IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation☆115Updated 3 months ago
- [AAAI 2025] Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration☆20Updated 10 months ago
- Official implementation of "NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM" (ACL'25 …☆46Updated 7 months ago
- Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", …☆119Updated 11 months ago
- ☆18Updated 7 months ago
- Official code release for "Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning"☆56Updated 2 years ago
- A new zero-shot framework to explore and search for the language descriptive targets in unknown environment based on Large Vision Languag…☆45Updated 10 months ago
- [CVPR 2023] We propose a framework for the challenging 3D-aware ObjectNav based on two straightforward sub-policies. The two sub-polices,…☆78Updated last year
- Official implementation of Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts(IJCAI 2024)☆14Updated last year
- Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation☆61Updated 9 months ago
- This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…☆14Updated last year
- https://xgxvisnav.github.io/☆21Updated last year
- Code for OctoNav-R1☆57Updated 4 months ago