zhaoc5 / Grounding-REVERIE-ChallengeView external linksLinks
Official REVERIE Grounding Model of REVERIE Challenge @ CSIG 2022
☆19Oct 17, 2022Updated 3 years ago
Alternatives and similar repositories for Grounding-REVERIE-Challenge
Users that are interested in Grounding-REVERIE-Challenge are comparing it to the libraries listed below
Sorting:
- Baseline for REVERIE-Challenge using HOP☆10Jul 4, 2022Updated 3 years ago
- This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…☆14Jun 6, 2024Updated last year
- Official implementation of the ECCV 2022 Oral paper: Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments☆35Dec 16, 2023Updated 2 years ago
- Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation☆56Oct 26, 2021Updated 4 years ago
- ☆33Aug 19, 2023Updated 2 years ago
- ☆23Dec 9, 2021Updated 4 years ago
- Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments☆11Nov 29, 2021Updated 4 years ago
- A human-annotated, fine-grained dataset for Vision-and-Language Navigation☆12Jan 20, 2022Updated 4 years ago
- Official implementation of the NRNS paper☆36Jun 13, 2022Updated 3 years ago
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆314Nov 7, 2023Updated 2 years ago
- Matterport and Unreal Engine Extension for Omniverse Isaac Sim☆20May 9, 2024Updated last year
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆26May 22, 2024Updated last year
- Code of the CVPR 2022 paper "HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation"☆30Aug 21, 2023Updated 2 years ago
- REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments☆148Feb 7, 2026Updated last week
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆24Apr 14, 2025Updated 10 months ago
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆255Jun 27, 2023Updated 2 years ago
- Codebase for the Airbert paper☆47Mar 20, 2023Updated 2 years ago
- ☆14Sep 21, 2022Updated 3 years ago
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].☆16Apr 13, 2023Updated 2 years ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆20Jul 24, 2023Updated 2 years ago
- Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…☆144Oct 31, 2023Updated 2 years ago
- ☆24Oct 8, 2023Updated 2 years ago
- Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).☆143Jun 14, 2023Updated 2 years ago
- code of the paper "Vision-Language Navigation with Multi-granularity Observation and Auxiliary Reasoning Tasks"☆23Mar 23, 2021Updated 4 years ago
- Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)☆20Oct 21, 2023Updated 2 years ago
- [ICCV 2025] Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts☆35Dec 17, 2025Updated 2 months ago
- Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)☆27Jul 30, 2023Updated 2 years ago
- ☆55Apr 1, 2022Updated 3 years ago
- PyTorch implementation of "Vision-Dialog Navigation by Exploring Cross-modal Memory", CVPR 2020.☆19Nov 22, 2022Updated 3 years ago
- Training code of waypoint predictor in Discrete-to-Continuous VLN.☆27Mar 25, 2024Updated last year
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆46Jun 2, 2025Updated 8 months ago
- This is the official project webpage of BHSD (MLMI 2023).☆25Nov 3, 2025Updated 3 months ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆76Dec 26, 2025Updated last month
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆66Dec 27, 2024Updated last year
- [SIGGRAPH Asia 2025] The official implementation of the paper "DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinat…☆32Nov 22, 2025Updated 2 months ago
- ☆84Apr 12, 2022Updated 3 years ago
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆45Jul 14, 2025Updated 7 months ago