cshizhe / VLN-HAMT
Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).
☆99Updated last year
Related projects ⓘ
Alternatives and complementary repositories for VLN-HAMT
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆116Updated last year
- Code of the CVPR 2022 paper "HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation"☆29Updated last year
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆32Updated 2 years ago
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆24Updated 6 months ago
- Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation☆43Updated 3 years ago
- Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation☆153Updated 2 years ago
- REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments☆111Updated last year
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆28Updated last year
- Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…☆92Updated last year
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆35Updated 3 months ago
- ☆42Updated 2 years ago
- Official Implementation of IVLN-CE: Iterative Vision-and-Language Navigation in Continuous Environments☆28Updated 11 months ago
- ☆12Updated 2 years ago
- Training code of waypoint predictor in Discrete-to-Continuous VLN.☆17Updated 7 months ago
- Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"☆15Updated last year
- ☆26Updated last year
- Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation☆17Updated 2 years ago
- Code for A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation☆15Updated 6 months ago
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆41Updated last year
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆34Updated last year
- Official implementation of the ECCV 2022 Oral paper: Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments☆25Updated 11 months ago
- ☆22Updated 2 years ago
- Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation☆45Updated 3 years ago
- Implementation (R2R part) for the paper "Iterative Vision-and-Language Navigation"☆13Updated 7 months ago
- Ideas and thoughts about the fascinating Vision-and-Language Navigation☆155Updated last year
- Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)☆52Updated 2 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆87Updated last year
- Code for MM 22 "Target-Driven Structured Transformer Planner for Vision-Language Navigation"☆14Updated 2 years ago
- Baseline for REVERIE-Challenge using HOP☆9Updated 2 years ago
- Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)☆17Updated last year