YicongHong/Thinking-VLN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YicongHong/Thinking-VLN)

YicongHong / Thinking-VLN

Ideas and thoughts about the fascinating Vision-and-Language Navigation

☆305

Alternatives and similar repositories for Thinking-VLN

Users that are interested in Thinking-VLN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YicongHong / Recurrent-VLN-BERT
View on GitHub
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
☆209Aug 13, 2022Updated 3 years ago
ChanganVR / awesome-embodied-vision
View on GitHub
Reading list for research topics in embodied vision
☆705Jun 13, 2025Updated last year
YicongHong / Discrete-Continuous-VLN
View on GitHub
Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…
☆156Oct 31, 2023Updated 2 years ago
cshizhe / VLN-DUET
View on GitHub
Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).
☆284Jun 27, 2023Updated 3 years ago
UCSB-AI / awesome-vision-language-navigation
View on GitHub
A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future…
☆600May 2, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
arjunmajum / vln-bert
View on GitHub
Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)
☆59Oct 7, 2022Updated 3 years ago
jialuli-luka / VLN-SIG
View on GitHub
☆35Aug 19, 2023Updated 2 years ago
jialuli-luka / EnvEdit
View on GitHub
Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)
☆30Aug 2, 2022Updated 3 years ago
MarSaKi / ETPNav
View on GitHub
[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"
☆478Apr 27, 2026Updated 2 months ago
GengzeZhou / NavGPT
View on GitHub
[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
☆346Nov 7, 2023Updated 2 years ago
MarSaKi / VLN-BEVBert
View on GitHub
[ICCV 2023] Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
☆260Apr 27, 2026Updated 2 months ago
batra-mlp-lab / vln-sim2real
View on GitHub
Code for sim-to-real transfer of a pretrained Vision-and-Language Navigation (VLN) agent to a robot using ROS.
☆46Nov 10, 2020Updated 5 years ago
jacobkrantz / VLN-CE
View on GitHub
Vision-and-Language Navigation in Continuous Environments using Habitat
☆841Jan 7, 2025Updated last year
daqingliu / awesome-vln
View on GitHub
A curated list of research papers in Vision-Language Navigation (VLN)
☆238Apr 17, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
YuankaiQi / REVERIE
View on GitHub
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
☆158May 15, 2026Updated 2 months ago
YicongHong / Entity-Graph-VLN
View on GitHub
Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation
☆47Oct 31, 2021Updated 4 years ago
wz0919 / ScaleVLN
View on GitHub
[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation
☆225Jul 2, 2025Updated last year
PeihaoChen / WS-MGMap
View on GitHub
Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…
☆35Apr 23, 2023Updated 3 years ago
GengzeZhou / NavGPT-2
View on GitHub
[ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
☆245Apr 3, 2026Updated 3 months ago
MrZihan / Sim2Real-VLN-3DFF
View on GitHub
Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).
☆80Dec 26, 2025Updated 6 months ago
YanyuanQiao / HOP-VLN
View on GitHub
Code of the CVPR 2022 paper "HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation"
☆31Aug 21, 2023Updated 2 years ago
YicongHong / Fine-Grained-R2R
View on GitHub
Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation
☆59Oct 26, 2021Updated 4 years ago
peteanderson80 / Matterport3DSimulator
View on GitHub
AI Research Platform for Reinforcement Learning from Real Panoramic Images.
☆707Jul 12, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jialuli-luka / PanoGen
View on GitHub
Code and Data for Paper: PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
☆83May 31, 2023Updated 3 years ago
chihyaoma / selfmonitoring-agent
View on GitHub
PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
☆123Oct 3, 2023Updated 2 years ago
airsplay / R2R-EnvDrop
View on GitHub
PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"
☆146Oct 23, 2021Updated 4 years ago
YicongHong / Ego2Map-NaViT
View on GitHub
Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)
☆28Jul 30, 2023Updated 2 years ago
google-research-datasets / RxR
View on GitHub
Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It c…
☆189Jul 26, 2023Updated 2 years ago
Hoyyyaard / NavGPT
View on GitHub
☆10Nov 16, 2023Updated 2 years ago
jzhzhang / NaVid-VLN-CE
View on GitHub
[RSS 2024 & RSS 2025] VLN-CE evaluation code of NaVid and Uni-NaVid
☆437Oct 15, 2025Updated 9 months ago
YuankaiQi / ORIST
View on GitHub
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
☆16Feb 7, 2022Updated 4 years ago
zehao-wang / LAD
View on GitHub
Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].
☆16Apr 13, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ggeorgak11 / CM2
View on GitHub
☆59Apr 1, 2022Updated 4 years ago
jacobkrantz / Sim2Sim-VLNCE
View on GitHub
Official implementation of the ECCV 2022 Oral paper: Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments
☆35Dec 16, 2023Updated 2 years ago
wz0919 / VLN-SRDF
View on GitHub
Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
☆35Jun 10, 2025Updated last year
cshizhe / VLN-HAMT
View on GitHub
Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).
☆147Jun 14, 2023Updated 3 years ago
IMNearth / Curriculum-Learning-For-VLN
View on GitHub
Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"
☆16Dec 13, 2022Updated 3 years ago
zd11024 / NaviLLM
View on GitHub
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
☆238Jun 18, 2024Updated 2 years ago
zhangyuejoslin / VLN-Survey-with-Foundation-Models
View on GitHub
[TMLR 2024] repository for VLN with foundation models
☆294Apr 17, 2026Updated 3 months ago