Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)
☆14Jan 8, 2024Updated 2 years ago
Alternatives and similar repositories for FDA
Users that are interested in FDA are comparing it to the libraries listed below
Sorting:
- ☆10Nov 16, 2023Updated 2 years ago
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].☆16Apr 13, 2023Updated 2 years ago
- A human-annotated, fine-grained dataset for Vision-and-Language Navigation☆12Jan 20, 2022Updated 4 years ago
- [ACM MM 2022] Target-Driven Structured Transformer Planner for Vision-Language Navigation☆17Nov 1, 2022Updated 3 years ago
- Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"☆14Dec 13, 2022Updated 3 years ago
- ☆15Jun 14, 2025Updated 8 months ago
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆29Dec 16, 2024Updated last year
- Official implementation of NeurIPS 2022 paper "Learning Active Camera for Multi-Object Navigation"☆10Apr 23, 2023Updated 2 years ago
- ☆11Jul 16, 2024Updated last year
- Baseline for REVERIE-Challenge using HOP☆10Jul 4, 2022Updated 3 years ago
- Official implementation of the NRNS paper☆36Jun 13, 2022Updated 3 years ago
- Awesome habitat top down map work 🤩☆35Apr 7, 2024Updated last year
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆33Apr 23, 2023Updated 2 years ago
- ☆17Dec 11, 2024Updated last year
- Official Repository for the ACM MM 2024 paper "Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments"☆15May 16, 2025Updated 9 months ago
- ☆18Mar 12, 2025Updated 11 months ago
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆14Feb 7, 2025Updated last year
- Code of the paper "Correctable Landmark Discovery via Large Models for Vision-Language Navigation" (TPAMI 2024)☆16Jun 7, 2024Updated last year
- Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation. CVPR 2022☆35Oct 27, 2022Updated 3 years ago
- PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.☆15Jun 4, 2024Updated last year
- ☆33Aug 19, 2023Updated 2 years ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆75Dec 26, 2025Updated 2 months ago
- ☆17Oct 25, 2022Updated 3 years ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆45Aug 6, 2024Updated last year
- ☆58Oct 28, 2025Updated 4 months ago
- [ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language☆17Dec 3, 2024Updated last year
- ☆21Oct 19, 2024Updated last year
- Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation☆16Feb 7, 2022Updated 4 years ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆49Dec 20, 2024Updated last year
- ☆23Jul 10, 2025Updated 7 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Nov 25, 2025Updated 3 months ago
- Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…☆146Oct 31, 2023Updated 2 years ago
- ☆24Oct 8, 2023Updated 2 years ago
- ☆20Mar 11, 2022Updated 3 years ago
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆236Sep 20, 2024Updated last year
- ☆23Mar 9, 2023Updated 2 years ago
- A python3 library for evaluating caption's BLEU, Meteor, CIDEr, SPICE,ROUGE_L,WMD score. Fork from https://github.com/ruotianluo/coco-cap…☆22Nov 25, 2020Updated 5 years ago