Jirl-upenn/VLMnav

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jirl-upenn/VLMnav)

Jirl-upenn / VLMnav

End-to-End Navigation with VLMs

☆116

Alternatives and similar repositories for VLMnav

Users that are interested in VLMnav are comparing it to the libraries listed below

Sorting:

B0B8K1ng / WMNavigation
View on GitHub
[IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation
☆146Oct 24, 2025Updated 4 months ago
naokiyokoyama / ovon
View on GitHub
Open Vocabulary Object Navigation
☆117May 15, 2025Updated 9 months ago
GAMMA-UMD-Outdoor-Navigation / BehAV
View on GitHub
BehAV: Behavioral Rule Guided Autonomy Using VLM for Robot Navigation in Outdoor Scenes (ICRA'25)
☆38Oct 3, 2024Updated last year
honghd16 / GSA-VLN
View on GitHub
Official repository of General Scene Adaptation for Vision-and-Language Navigation (ICLR'2025)
☆65Apr 16, 2025Updated 10 months ago
LYX0501 / InstructNav
View on GitHub
☆193Mar 29, 2025Updated 11 months ago
sx-zhang / SGM
View on GitHub
Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation (CVPR2024)
☆55Mar 27, 2025Updated 11 months ago
chen-judge / MapGPT
View on GitHub
[ACL 24] The official implementation of MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation.
☆122May 3, 2025Updated 10 months ago
ybgdgh / VLN-Game
View on GitHub
A new zero-shot framework to explore and search for the language descriptive targets in unknown environment based on Large Vision Languag…
☆56Nov 28, 2024Updated last year
HCPLab-SYSU / LH-VLN
View on GitHub
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method (CVPR-25)
☆228Aug 20, 2025Updated 6 months ago
Zeying-Gong / ascent
View on GitHub
[RAL‘26] Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration
☆81Jan 11, 2026Updated last month
yhanCao / CogNav_ObjNav
View on GitHub
the official implementation of CogNav [ICCV 2025]
☆63Sep 24, 2025Updated 5 months ago
shalexyuan / GAMap
View on GitHub
☆31Nov 6, 2024Updated last year
Ram81 / goat-bench
View on GitHub
☆126Jul 9, 2024Updated last year
jzhzhang / NaVid-VLN-CE
View on GitHub
[RSS 2024 & RSS 2025] VLN-CE evaluation code of NaVid and Uni-NaVid
☆376Oct 15, 2025Updated 4 months ago
wzcai99 / Pixel-Navigator
View on GitHub
Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", …
☆130Oct 30, 2024Updated last year
uzh-rpg / foresight-nav
View on GitHub
[CVPR Workshop 2025 - OpenSun3D] ForesightNav: Learning Scene Imagination for Efficient Exploration
☆70Apr 23, 2025Updated 10 months ago
bdaiinstitute / vlfm
View on GitHub
The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)
☆688Nov 12, 2025Updated 3 months ago
peiqi-liu / stretch_ai
View on GitHub
☆14May 21, 2025Updated 9 months ago
bagh2178 / UniGoal
View on GitHub
[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
☆304Sep 16, 2025Updated 5 months ago
yxKryptonite / OpenFMNav
View on GitHub
Official implementation of OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models
☆59Sep 17, 2024Updated last year
MrZihan / Dynam3D
View on GitHub
Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)
☆76Dec 22, 2025Updated 2 months ago
Stanford-ILIAD / explore-eqa
View on GitHub
Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"
☆76Jul 5, 2024Updated last year
Robotics-STAR-Lab / ApexNav
View on GitHub
[RA-L'25] An Reliable and Efficient Framework for Zero-Shot Object Navigation
☆305Feb 10, 2026Updated 3 weeks ago
MrZihan / HNR-VLN
View on GitHub
Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…
☆104Apr 2, 2025Updated 11 months ago
zd11024 / NaviLLM
View on GitHub
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
☆229Jun 18, 2024Updated last year
ziyan-xiaoyu / SpatialMQA
View on GitHub
☆19May 28, 2025Updated 9 months ago
chen-judge / AO-Planner
View on GitHub
[AAAI 25] The official implementation of Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
☆46Mar 2, 2025Updated last year
vlmaps / vlmaps
View on GitHub
[ICRA2023] Implementation of Visual Language Maps for Robot Navigation
☆649Jul 9, 2024Updated last year
PKU-HMI-Lab / AC-DiT
View on GitHub
AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
☆31Feb 23, 2026Updated last week
Jbwasse2 / XGX
View on GitHub
https://xgxvisnav.github.io/
☆22Dec 22, 2023Updated 2 years ago
XiaohanLei / GaussNav
View on GitHub
PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation
☆194Nov 11, 2024Updated last year
bagh2178 / SG-Nav
View on GitHub
[NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
☆319Sep 16, 2025Updated 5 months ago
KTH-RPL / OneMap
View on GitHub
[ICRA'25] One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation
☆135Oct 28, 2025Updated 4 months ago
ybgdgh / L3MVN
View on GitHub
Leveraging Large Language Models for Visual Target Navigation
☆157Oct 24, 2023Updated 2 years ago
jackyzengl / THUD_Dataset_Overview
View on GitHub
THUD Dataset Overview
☆26May 22, 2024Updated last year
jacobkrantz / VLN-CE
View on GitHub
Vision-and-Language Navigation in Continuous Environments using Habitat
☆729Jan 7, 2025Updated last year
AdityaNG / general-navigation
View on GitHub
General Navigation Models based on GNM, ViNT, NoMaD as a pytorch repo for quick and easy deployment
☆14Nov 18, 2024Updated last year
InternRobotics / InternNav
View on GitHub
InternRobotics' open platform for building generalized navigation foundation models.
☆699Updated this week
GengzeZhou / NavGPT-2
View on GitHub
[ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
☆236Sep 20, 2024Updated last year