InternRobotics' open platform for building generalized navigation foundation models.
☆732Mar 10, 2026Updated last week
Alternatives and similar repositories for InternNav
Users that are interested in InternNav are comparing it to the libraries listed below
Sorting:
- Official implementation of the paper: "NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance"☆551Jan 12, 2026Updated 2 months ago
- [ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"☆426Nov 2, 2025Updated 4 months ago
- [RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"☆551Aug 20, 2025Updated 7 months ago
- [CoRL 2025] Repository relating to "TrackVLA: Embodied Visual Tracking in the Wild"☆357Nov 25, 2025Updated 3 months ago
- [RSS 2024 & RSS 2025] VLN-CE evaluation code of NaVid and Uni-NaVid☆381Oct 15, 2025Updated 5 months ago
- The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)☆700Nov 12, 2025Updated 4 months ago
- InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.☆48Sep 18, 2025Updated 6 months ago
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆47Jul 14, 2025Updated 8 months ago
- Vision-and-Language Navigation in Continuous Environments using Habitat☆742Jan 7, 2025Updated last year
- [AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling☆91Jan 11, 2026Updated 2 months ago
- An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.☆169Oct 15, 2025Updated 5 months ago
- [RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.☆248Dec 15, 2025Updated 3 months ago
- Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method (CVPR-25)☆232Aug 20, 2025Updated 7 months ago
- A simulation platform for versatile Embodied AI research and developments.☆1,219Sep 4, 2025Updated 6 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆231Oct 17, 2025Updated 5 months ago
- Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.☆1,158Sep 15, 2024Updated last year
- [NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆73Sep 29, 2025Updated 5 months ago
- [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"☆427Apr 5, 2025Updated 11 months ago
- Code for OctoNav-R1☆65Mar 11, 2026Updated last week
- [ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆78Mar 13, 2026Updated last week
- Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", …☆132Oct 30, 2024Updated last year
- [CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation☆311Sep 16, 2025Updated 6 months ago
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆237Sep 20, 2024Updated last year
- PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation☆197Nov 11, 2024Updated last year
- Vision-Language Navigation Benchmark in Isaac Lab☆300Aug 28, 2025Updated 6 months ago
- Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation (CVPR2024)☆56Mar 27, 2025Updated 11 months ago
- ☆246Aug 6, 2025Updated 7 months ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆230Jun 18, 2024Updated last year
- ☆85Dec 29, 2025Updated 2 months ago
- ☆197Mar 29, 2025Updated 11 months ago
- This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…☆14Jun 6, 2024Updated last year
- End-to-End Navigation with VLMs☆118Feb 26, 2026Updated 3 weeks ago
- [ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation☆212Jul 2, 2025Updated 8 months ago
- [RAL-25] An online open-vocabulary mapping system that enables natural language querying to navigate dynamic scenes, with ROS support.☆161Jan 1, 2026Updated 2 months ago
- Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning" (TPAMI 2025)☆132Jun 4, 2025Updated 9 months ago
- Low-level locomotion policy training in Isaac Lab☆413Mar 7, 2025Updated last year
- ☆64Mar 10, 2026Updated last week
- [IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation☆150Oct 24, 2025Updated 4 months ago
- GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models☆500Mar 2, 2026Updated 2 weeks ago