wz0919 / ScaleVLN
[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation
☆142Updated last week
Related projects: ⓘ
- [ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"☆181Updated 10 months ago
- [ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"☆77Updated last year
- [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"☆191Updated last month
- ☆25Updated 11 months ago
- Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)☆12Updated 8 months ago
- The Pytorch implementation of Grounding 3D Object Affordance from 2D Interactios in Images.☆112Updated 10 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆115Updated 11 months ago
- ☆24Updated last year
- Official REVERIE Grounding Model of REVERIE Challenge @ CSIG 2022☆19Updated last year
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆32Updated 2 years ago
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆22Updated 3 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆43Updated 2 months ago
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆38Updated last year
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆34Updated last month
- ☆85Updated 2 months ago
- [CVPR24] Volumetric Environment Representation for Vision-Language Navigation☆27Updated last week
- Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)☆16Updated 10 months ago
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆27Updated last year
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆72Updated last year
- ☆10Updated last month
- ☆99Updated last year
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding (AAAI'23).☆16Updated last year
- Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)☆24Updated last year
- Code&Data for Grounded 3D-LLM with Referent Tokens☆74Updated 2 months ago
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆106Updated last year
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆32Updated last year
- 😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.☆56Updated 2 weeks ago
- Training code of waypoint predictor in Discrete-to-Continuous VLN.☆15Updated 5 months ago
- [CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…☆45Updated this week
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆32Updated 2 months ago