[IROS 24] Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We present the first dataset - R2R-IE-CE - to benchmark instructions errors in VLN. We then propose a method, IEDL.
☆18Jan 8, 2025Updated last year
Alternatives and similar repositories for R2RIE-CE
Users that are interested in R2RIE-CE are comparing it to the libraries listed below
Sorting:
- ☆23Mar 9, 2023Updated 2 years ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆49Dec 20, 2024Updated last year
- Awesome habitat top down map work 🤩☆35Apr 7, 2024Updated last year
- ☆17Jan 19, 2026Updated last month
- code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)☆10Jul 17, 2022Updated 3 years ago
- Official code for IROS 2025 paper "TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Ver…☆16Dec 27, 2025Updated 2 months ago
- Official implementation of GridMM: Grid Memory Map for Vision-and-Language Navigation (ICCV'23).☆102Apr 18, 2024Updated last year
- Official Repository for the ACM MM 2024 paper "Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments"☆15May 16, 2025Updated 9 months ago
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆104Apr 2, 2025Updated 11 months ago
- ☆18Mar 12, 2025Updated 11 months ago
- Code of the paper "Correctable Landmark Discovery via Large Models for Vision-Language Navigation" (TPAMI 2024)☆16Jun 7, 2024Updated last year
- [ICCV 2025] MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation☆51Oct 14, 2025Updated 4 months ago
- [AAAI 2025] Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration☆25Dec 13, 2024Updated last year
- This is the source code to paper “DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation”.☆30Aug 13, 2025Updated 6 months ago
- Towards Collaborative Semantic Visual Navigation via Vision Language Models☆34Jul 3, 2025Updated 8 months ago
- [ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language☆17Dec 3, 2024Updated last year
- official implementation for ECCV 2024 paper "Prioritized Semantic Learning for Zero-shot Instance Navigation"☆45Jun 6, 2025Updated 8 months ago
- SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World☆147Nov 4, 2024Updated last year
- Code for Paper "Towards More Generalizable One-Shot Visual Imitation Learning", ICRA 2022☆20May 5, 2022Updated 3 years ago
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆31Feb 23, 2026Updated last week
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆45Jun 19, 2025Updated 8 months ago
- ☆30Sep 1, 2025Updated 6 months ago
- [ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation☆211Jul 2, 2025Updated 8 months ago
- [ICRA 25] InsCMPR: Efficient Cross-Modal Place Recognition via Instance-Aware Hybrid Mamba-Transformer☆27Sep 29, 2025Updated 5 months ago
- DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments☆24Apr 8, 2025Updated 10 months ago
- [IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation☆146Oct 24, 2025Updated 4 months ago
- [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"☆423Apr 5, 2025Updated 11 months ago
- ☆28Jun 24, 2024Updated last year
- Official code for the long-horizon language-conditioned robotic manipulation benchmark LoHoRavens.☆22Oct 8, 2024Updated last year
- Reproduce of MPC-D-CBF☆26Oct 9, 2024Updated last year
- [ICCV 2025] Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts☆35Dec 17, 2025Updated 2 months ago
- ☆28Aug 31, 2023Updated 2 years ago
- Official implementation of OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models☆59Sep 17, 2024Updated last year
- PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation☆191Nov 11, 2024Updated last year
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆29Dec 16, 2024Updated last year
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆38May 25, 2025Updated 9 months ago
- [AAAI-25 Oral] Official Implementation of "FLAME: Learning to Navigate with Multimodal LLM in Urban Environments"☆69Nov 2, 2025Updated 4 months ago
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆69Mar 17, 2025Updated 11 months ago
- ☆80Apr 24, 2025Updated 10 months ago