hekj / FDALinks
Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)
☆14Updated last year
Alternatives and similar repositories for FDA
Users that are interested in FDA are comparing it to the libraries listed below
Sorting:
- ☆25Updated last year
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].☆17Updated 2 years ago
- ☆10Updated 2 years ago
- [ACM MM 2022] Target-Driven Structured Transformer Planner for Vision-Language Navigation☆15Updated 2 years ago
- ☆10Updated last year
- Embodied Question Answering (EQA) benchmark and method☆20Updated 2 months ago
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆23Updated 5 months ago
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆31Updated 2 years ago
- An Examination of the Compositionality of Large Generative Vision-Language Models☆19Updated last year
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆84Updated 11 months ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆21Updated last year
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆27Updated last year
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆43Updated 10 months ago
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆23Updated 5 months ago
- Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"☆15Updated 2 years ago
- Dataset and baseline for Scenario Oriented Object Navigation (SOON)☆18Updated 3 years ago
- ☆37Updated last year
- Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel☆25Updated 5 months ago
- ☆18Updated 2 years ago
- Baseline for REVERIE-Challenge using HOP☆10Updated 2 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆39Updated last year
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆31Updated last year
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆75Updated this week
- ☆20Updated 3 years ago
- Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)☆30Updated 3 weeks ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated 8 months ago
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆41Updated 2 years ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆47Updated 5 months ago
- [ACL2023] Official code repository for VLN-Trans☆13Updated last year
- Official REVERIE Grounding Model of REVERIE Challenge @ CSIG 2022☆19Updated 2 years ago