raphael-sch / VELMA
VELMA agent for VLN in Street View
☆16Updated last year
Alternatives and similar repositories for VELMA:
Users that are interested in VELMA are comparing it to the libraries listed below
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆25Updated 9 months ago
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆149Updated last year
- Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" Paper//arxiv.org/abs/2310.07968 …☆27Updated 8 months ago
- Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning"☆39Updated 10 months ago
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆38Updated 6 months ago
- Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).☆110Updated last year
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆52Updated 4 months ago
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆50Updated 2 months ago
- ☆33Updated last year
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆43Updated 7 months ago
- Official Implementation of ReALFRED (ECCV'24)☆37Updated 4 months ago
- Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"☆15Updated 2 years ago
- Code for MM 22 "Target-Driven Structured Transformer Planner for Vision-Language Navigation"☆15Updated 2 years ago
- Training code of waypoint predictor in Discrete-to-Continuous VLN.☆18Updated 11 months ago
- ☆47Updated 2 years ago
- ☆75Updated 7 months ago
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆31Updated 2 years ago
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆39Updated last year
- official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"☆31Updated last year
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆22Updated 2 months ago
- Fast-Slow Test-time Adaptation for Online Vision-and-Language Navigation☆27Updated 4 months ago
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆28Updated last year
- [ECCV 2022] Official pytorch implementation of the paper "FedVLN: Privacy-preserving Federated Vision-and-Language Navigation"☆14Updated 2 years ago
- Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…☆103Updated last year
- Code of the CVPR 2022 paper "HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation"☆29Updated last year
- ☆14Updated last year
- ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings. NeurIPS 2022☆67Updated 2 years ago
- Code for ORAR Agent for Vision and Language Navigation on Touchdown and map2seq☆14Updated last year
- ☆10Updated last year
- Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method (CVPR-25)☆22Updated this week