GT-RIPL / robo-vln
Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"
☆78Updated 10 months ago
Alternatives and similar repositories for robo-vln:
Users that are interested in robo-vln are comparing it to the libraries listed below
- Code for reproducing the results of NeurIPS 2020 paper "MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation”☆50Updated 4 years ago
- ☆49Updated 3 years ago
- Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation☆175Updated 2 years ago
- Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…☆118Updated last year
- Habitat-Web is a web application to collect human demonstrations for embodied tasks on Amazon Mechanical Turk (AMT) using the Habitat sim…☆54Updated 2 years ago
- Official Implementation of IVLN-CE: Iterative Vision-and-Language Navigation in Continuous Environments☆31Updated last year
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆55Updated 4 months ago
- Code for sim-to-real transfer of a pretrained Vision-and-Language Navigation (VLN) agent to a robot using ROS.☆41Updated 4 years ago
- ☆33Updated last year
- Codebase for the Airbert paper☆45Updated 2 years ago
- ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings. NeurIPS 2022☆73Updated 2 years ago
- Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation☆45Updated 3 years ago
- Dataset and baseline for Scenario Oriented Object Navigation (SOON)☆18Updated 3 years ago
- PONI: Potential Functions for ObjectGoal Navigation with Interaction-free Learning. CVPR 2022 (Oral).☆98Updated 2 years ago
- Official implementation of the NRNS paper☆36Updated 2 years ago
- Official codebase for EmbCLIP☆125Updated last year
- 🔀 Visual Room Rearrangement☆113Updated last year
- Resources for Auxiliary Tasks and Exploration Enable ObjectNav☆40Updated 3 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆90Updated last year
- ☆80Updated 3 years ago
- Code for training embodied agents using imitation learning at scale in Habitat-Lab☆40Updated 3 weeks ago
- [CVPR 2023] CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation☆128Updated last year
- Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"☆38Updated 3 years ago
- Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).☆121Updated last year
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆41Updated 2 years ago
- ☆44Updated 3 years ago
- Code for training embodied agents using IL and RL finetuning at scale for ObjectNav☆70Updated 3 weeks ago
- REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments☆125Updated last year
- Pushing it out of the Way: Interactive Visual Navigation☆37Updated last year
- ☆34Updated 3 years ago