End-to-End Navigation with VLMs
☆120Feb 26, 2026Updated last month
Alternatives and similar repositories for VLMnav
Users that are interested in VLMnav are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation☆160Mar 24, 2026Updated 3 weeks ago
- ☆204Mar 29, 2025Updated last year
- Official repository of General Scene Adaptation for Vision-and-Language Navigation (ICLR'2025)☆68Apr 16, 2025Updated last year
- BehAV: Behavioral Rule Guided Autonomy Using VLM for Robot Navigation in Outdoor Scenes (ICRA'25)☆40Oct 3, 2024Updated last year
- Open Vocabulary Object Navigation☆125May 15, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆137Jul 9, 2024Updated last year
- Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method (CVPR-25)☆232Aug 20, 2025Updated 7 months ago
- [ACL 24] The official implementation of MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation.☆127May 3, 2025Updated 11 months ago
- Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", …☆131Oct 30, 2024Updated last year
- ☆31Nov 6, 2024Updated last year
- Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation (CVPR2024)☆58Mar 27, 2025Updated last year
- [RSS 2024 & RSS 2025] VLN-CE evaluation code of NaVid and Uni-NaVid☆397Oct 15, 2025Updated 6 months ago
- A new zero-shot framework to explore and search for the language descriptive targets in unknown environment based on Large Vision Languag…☆62Nov 28, 2024Updated last year
- The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)☆725Nov 12, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [AAAI 25] The official implementation of Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation☆47Mar 2, 2025Updated last year
- ☆14May 21, 2025Updated 10 months ago
- General Navigation Models based on GNM, ViNT, NoMaD as a pytorch repo for quick and easy deployment☆14Nov 18, 2024Updated last year
- the official implementation of CogNav [ICCV 2025]☆72Sep 24, 2025Updated 6 months ago
- [CVPR Workshop 2025 - OpenSun3D] ForesightNav: Learning Scene Imagination for Efficient Exploration☆70Apr 23, 2025Updated 11 months ago
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆107Apr 2, 2025Updated last year
- THUD Dataset Overview☆27May 22, 2024Updated last year
- Official implementation of OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models☆60Sep 17, 2024Updated last year
- [NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation☆323Sep 16, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation☆320Sep 16, 2025Updated 7 months ago
- [RA-L'25] An Reliable and Efficient Framework for Zero-Shot Object Navigation☆336Apr 3, 2026Updated last week
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆78Jul 5, 2024Updated last year
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆231Jun 18, 2024Updated last year
- [RAL‘26] Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration☆100Jan 11, 2026Updated 3 months ago
- Code for ICCV 2023 paper "Multi-Object Navigation with dynamically learned neural implicit representations"☆14Mar 20, 2024Updated 2 years ago
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆39Feb 23, 2026Updated last month
- Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)☆81Dec 22, 2025Updated 3 months ago
- Vision-and-Language Navigation in Continuous Environments using Habitat☆769Jan 7, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆18Mar 12, 2025Updated last year
- [ICRA'25] One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation☆142Oct 28, 2025Updated 5 months ago
- PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation☆201Nov 11, 2024Updated last year
- [ICRA2023] Implementation of Visual Language Maps for Robot Navigation☆667Jul 9, 2024Updated last year
- CVPR 2026 - MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation☆41Mar 23, 2026Updated 3 weeks ago
- InternRobotics' open platform for building generalized navigation foundation models.☆787Mar 10, 2026Updated last month
- Leveraging Large Language Models for Visual Target Navigation☆161Oct 24, 2023Updated 2 years ago