code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)
☆10Jul 17, 2022Updated 3 years ago
Alternatives and similar repositories for ADAPT
Users that are interested in ADAPT are comparing it to the libraries listed below
Sorting:
- ☆14Sep 21, 2022Updated 3 years ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- ☆16Jun 12, 2024Updated last year
- ☆23Mar 9, 2023Updated 2 years ago
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- Official Repository for the ACM MM 2024 paper "Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments"☆15May 16, 2025Updated 9 months ago
- [IROS 24] Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We…☆18Jan 8, 2025Updated last year
- ☆33Sep 25, 2024Updated last year
- code for TIDEE: Novel Room Reorganization using Visuo-Semantic Common Sense Priors☆40Nov 21, 2023Updated 2 years ago
- ☆33Aug 19, 2023Updated 2 years ago
- Code for CVPR22 paper One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones☆13Jul 27, 2022Updated 3 years ago
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆45Aug 6, 2024Updated last year
- Code and Data for Paper: PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation☆80May 31, 2023Updated 2 years ago
- Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation☆16Feb 7, 2022Updated 4 years ago
- VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs☆48Jan 5, 2026Updated last month
- ☆23Dec 9, 2021Updated 4 years ago
- ☆22Oct 16, 2025Updated 4 months ago
- Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)☆20Oct 21, 2023Updated 2 years ago
- code of the paper "Vision-Language Navigation with Multi-granularity Observation and Auxiliary Reasoning Tasks"☆23Mar 23, 2021Updated 4 years ago
- ☆55Apr 1, 2022Updated 3 years ago
- Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…☆28Mar 4, 2022Updated 3 years ago
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆29Dec 16, 2024Updated last year
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆26May 22, 2024Updated last year
- Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"☆88Jun 27, 2024Updated last year
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆30Aug 2, 2022Updated 3 years ago
- Official implementation of GridMM: Grid Memory Map for Vision-and-Language Navigation (ICCV'23).☆102Apr 18, 2024Updated last year
- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins☆12Sep 20, 2024Updated last year
- A Maximal Mutual Information Criterion for Manipulation Concept Discovery☆13Sep 26, 2024Updated last year
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆43Mar 16, 2023Updated 2 years ago
- Pushing it out of the Way: Interactive Visual Navigation☆44Jan 26, 2024Updated 2 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆93Jul 11, 2023Updated 2 years ago
- ☆38Mar 10, 2022Updated 3 years ago
- Visual Question Generation☆11Aug 20, 2024Updated last year
- OpenPose CNN model compatible with Huawei Ascend Atlas 200DK☆10Jun 23, 2019Updated 6 years ago
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- [AAAI 2024]Weakly Supervised Multimodal Affordance Grounding for Egocentric Images☆13Nov 10, 2024Updated last year
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated 3 weeks ago
- ☆15Mar 18, 2025Updated 11 months ago
- ☆10Dec 6, 2019Updated 6 years ago