ai4ce / CityWalkerLinks

[CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

☆118

Alternatives and similar repositories for CityWalker

Users that are interested in CityWalker are comparing it to the libraries listed below

Sorting:

iminolee / Awesome-Vision-and-Language-Navigation
A curated list of awesome Vision-and-Language Navigation(VLN) resources (continually updated)
☆90Updated 4 months ago
bagh2178 / UniGoal
[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
☆180Updated last month
B0B8K1ng / WMNavigation
[IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation
☆96Updated last week
AnjieCheng / NaVILA
[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"
☆135Updated last week
XiaohanLei / GaussNav
PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation
☆142Updated 8 months ago
MrZihan / NavRAG
Official implementation of "NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM" (ACL'25 …
☆41Updated 4 months ago
honghd16 / GSA-VLN
Official repository of General Scene Adaptation for Vision-and-Language Navigation (ICLR'2025)
☆46Updated 3 months ago
HaochenZ11 / VLA-3D
☆67Updated 6 months ago
linukc / BeyondBareQueries
☆28Updated last month
jzhzhang / Uni-NaVid
[RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.
☆80Updated last month
BJHYZJ / DovSG
[RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
☆89Updated 2 months ago
BIT-DYN / OpenGraph
[RAL 2024] OpenGraphs: Open-Vocabulary Hierarchical 3D Scene Graphs in Large-Scale Outdoor Environments
☆111Updated 3 months ago
zhangyuejoslin / VLN-Survey-with-Foundation-Models
[TMLR 2024] repository for VLN with foundation models
☆134Updated 3 months ago
Jirl-upenn / VLMnav
End-to-End Navigation with VLMs
☆91Updated 3 months ago
changhaonan / OVSG
[CoRL2023] Open-Vocabulary Scene-Graph
☆68Updated last year
GeLuzhou / Dynamic-GSG
Dynamic 3D Gaussian Scene Graphs for Environment Adaptation
☆44Updated last month
MrZihan / Sim2Real-VLN-3DFF
Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).
☆67Updated 4 months ago
wsakobe / TrackVLA
Repository relating to "TrackVLA: Embodied Visual Tracking in the Wild"
☆120Updated last week
MrZihan / g3D-LF
Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).
☆29Updated this week
roomtour3d / roomtour3d-NaviLLM
[CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation
☆56Updated 3 months ago
buaa-colalab / OctoNav-R1
Code for OctoNav-R1
☆44Updated 3 weeks ago
OpenRobotLab / VLM-Grounder
[CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
☆108Updated last month
ai4ce / MSG
[NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)
☆116Updated 6 months ago
bagh2178 / SG-Nav
[NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
☆221Updated 4 months ago
SHAILAB-IPEC / OpenFly-Platform
☆148Updated 3 weeks ago
UMass-Embodied-AGI / 3D-Mem
[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"
☆149Updated last month
iris0329 / SeeGround
[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
☆152Updated 2 months ago
sx-zhang / SGM
Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation (CVPR2024)
☆47Updated 3 months ago
facebookresearch / nwm
Official code for the CVPR 2025 paper "Navigation World Models".
☆297Updated last week
Ram81 / goat-bench
☆102Updated last year