raphael-sch / VELMA
VELMA agent for VLN in Street View
☆13Updated 11 months ago
Related projects: ⓘ
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆61Updated 3 weeks ago
- ☆11Updated 10 months ago
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆32Updated last year
- official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"☆27Updated 7 months ago
- Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning"☆21Updated 5 months ago
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆106Updated last year
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆22Updated 3 months ago
- Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" Paper//arxiv.org/abs/2310.07968 …☆18Updated 3 months ago
- ☆24Updated last year
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆102Updated 3 months ago
- Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)☆24Updated last year
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆18Updated last year
- ☆53Updated 2 months ago
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆32Updated 2 years ago
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆34Updated last month
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆115Updated 11 months ago
- Code for MM 22 "Target-Driven Structured Transformer Planner for Vision-Language Navigation"☆14Updated last year
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆17Updated 2 months ago
- Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"☆15Updated last year
- ☆36Updated 5 months ago
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆27Updated last year
- Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…☆86Updated 10 months ago
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding (AAAI'23).☆16Updated last year
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆37Updated 2 weeks ago
- Repository of our accepted NeurIPS-2022 paper "Towards Versatile Embodied Navigation"☆20Updated last year
- Open Vocabulary Object Navigation☆15Updated 5 months ago
- Official Implementation of ReALFRED (ECCV'24)☆16Updated last month