intelligolabs / CoINLinks
[ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues"
☆24Updated 2 months ago
Alternatives and similar repositories for CoIN
Users that are interested in CoIN are comparing it to the libraries listed below
Sorting:
- Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" Paper//arxiv.org/abs/2310.07968 …☆31Updated last year
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆255Updated 2 years ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆229Updated last year
- Official Implementation of ReALFRED (ECCV'24)☆44Updated last year
- ☆55Updated 3 years ago
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆314Updated 2 years ago
- official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"☆44Updated 2 years ago
- Training code of waypoint predictor in Discrete-to-Continuous VLN.☆27Updated last year
- ☆123Updated 2 years ago
- [NeurIPS'2025] "OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis"☆27Updated 2 months ago
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆26Updated last year
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆74Updated last year
- Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning" (TPAMI 2025)☆128Updated 8 months ago
- Ideas and thoughts about the fascinating Vision-and-Language Navigation☆292Updated 2 years ago
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆238Updated last year
- ☆194Updated 10 months ago
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆33Updated 2 years ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆381Updated 3 months ago
- Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).☆142Updated 2 years ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆364Updated 10 months ago
- A collection of vision-language-action model post-training methods.☆125Updated last week
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆262Updated 3 months ago
- ☆124Updated last year
- SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World☆146Updated last year
- [AAAI-25 Oral] Official Implementation of "FLAME: Learning to Navigate with Multimodal LLM in Urban Environments"☆69Updated 3 months ago
- Vision-and-Language Navigation in Continuous Environments using Habitat☆718Updated last year
- The project repository for paper EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents: https://arxiv.org/abs…☆62Updated last year
- Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation☆62Updated last year
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆278Updated 11 months ago
- [RSS 2024 & RSS 2025] VLN-CE evaluation code of NaVid and Uni-NaVid☆368Updated 3 months ago