VELMA agent for VLN in Street View
☆30Sep 29, 2023Updated 2 years ago
Alternatives and similar repositories for VELMA
Users that are interested in VELMA are comparing it to the libraries listed below
Sorting:
- Implementation of "Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation"☆27Mar 4, 2021Updated 5 years ago
- ☆14Dec 16, 2021Updated 4 years ago
- Cornell Touchdown natural language navigation and spatial reasoning dataset.☆109Sep 5, 2020Updated 5 years ago
- Code for ORAR Agent for Vision and Language Navigation on Touchdown and map2seq☆20Nov 3, 2023Updated 2 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- ☆18Jul 8, 2025Updated 8 months ago
- ☆13Dec 8, 2022Updated 3 years ago
- [AAAI-25 Oral] Official Implementation of "FLAME: Learning to Navigate with Multimodal LLM in Urban Environments"☆68Nov 2, 2025Updated 4 months ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆62Apr 11, 2024Updated last year
- Official Repository for the ACM MM 2024 paper "Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments"☆15May 16, 2025Updated 10 months ago
- ☆28Feb 24, 2026Updated 3 weeks ago
- https://xgxvisnav.github.io/☆22Dec 22, 2023Updated 2 years ago
- ☆37Apr 2, 2024Updated last year
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆29Dec 16, 2024Updated last year
- Evaluation codes of "From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models".☆16May 15, 2023Updated 2 years ago
- Code and models for the CVPR 2017 paper "DeepNav: Learning to Navigate Large Cities"☆13Feb 16, 2020Updated 6 years ago
- Official implementation of the NRNS paper☆37Jun 13, 2022Updated 3 years ago
- Code of the paper "EvolveNav: Empowering LLM-Based Vision-Language Navigation via Self-Improving Embodied Reasoning"☆28Oct 14, 2025Updated 5 months ago
- ☆13Oct 15, 2025Updated 5 months ago
- Detail-Sensitive Panoramic Annular Semantic Segmentation☆12May 19, 2022Updated 3 years ago
- VHTest☆16Oct 31, 2024Updated last year
- ☆10Nov 16, 2023Updated 2 years ago
- ☆13Mar 13, 2023Updated 3 years ago
- Codebase for "Towards Generalizable Safety in Crowd Navigation via Conformal Uncertainty Handling" [CoRL 2025].☆29Jan 9, 2026Updated 2 months ago
- Code for <Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing> in WACV 2023☆12Jan 26, 2023Updated 3 years ago
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆322Nov 7, 2023Updated 2 years ago
- ☆20Nov 13, 2023Updated 2 years ago
- SP-SLAM: Surface-Point Simultaneous Localization and Mapping☆24Jan 22, 2021Updated 5 years ago
- 🔥 [NeurIPS 2024] A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embed…☆13Jun 21, 2025Updated 9 months ago
- This repo contains all the codes for SEScore implementation☆15Mar 3, 2025Updated last year
- ☆12Jan 11, 2023Updated 3 years ago
- FELA: Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation, AAAI 2025.☆37Dec 18, 2024Updated last year
- Focused on the safety and security of Embodied AI☆99Dec 19, 2025Updated 3 months ago
- Human-centered Delivery Benchmark☆20Jul 24, 2024Updated last year
- ☆10Oct 1, 2019Updated 6 years ago
- Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)☆14Jan 8, 2024Updated 2 years ago
- Contains scripts for the PSI competition.☆10Dec 11, 2023Updated 2 years ago
- ☆10Aug 29, 2019Updated 6 years ago
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆260Jun 27, 2023Updated 2 years ago