Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It contains 126k navigation instructions in English, Hindi and Telugu, and 126k navigation following demonstrations. Both annotation types include dense spatiotemporal alignments between the text and the visual per…
☆180Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for RxR
Users that are interested in RxR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation☆204Aug 13, 2022Updated 3 years ago
- Vision-and-Language Navigation in Continuous Environments using Habitat☆793Jan 7, 2025Updated last year
- Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…☆150Oct 31, 2023Updated 2 years ago
- AI Research Platform for Reinforcement Learning from Real Panoramic Images.☆692Jul 12, 2024Updated last year
- Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).☆145Jun 14, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Cooperative Vision-and-Dialog Navigation☆74Nov 22, 2022Updated 3 years ago
- REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments☆155May 15, 2026Updated last week
- [ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation☆217Jul 2, 2025Updated 10 months ago
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆273Jun 27, 2023Updated 2 years ago
- [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"☆459Apr 27, 2026Updated 3 weeks ago
- Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.☆137Nov 22, 2022Updated 3 years ago
- Code for sim-to-real transfer of a pretrained Vision-and-Language Navigation (VLN) agent to a robot using ROS.☆45Nov 10, 2020Updated 5 years ago
- Official Implementation of IVLN-CE: Iterative Vision-and-Language Navigation in Continuous Environments☆38Dec 16, 2023Updated 2 years ago
- large scale pretrain for navigation task☆95Mar 2, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation☆57Oct 26, 2021Updated 4 years ago
- Official implementation of the ECCV 2022 Oral paper: Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments☆35Dec 16, 2023Updated 2 years ago
- Dataset and baseline for Scenario Oriented Object Navigation (SOON)☆24Nov 23, 2021Updated 4 years ago
- Feature resources of "Diagnosing the Environment Bias in Vision-and-Language Navigation"☆16May 6, 2020Updated 6 years ago
- Ideas and thoughts about the fascinating Vision-and-Language Navigation☆302Jun 28, 2023Updated 2 years ago
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆30Aug 2, 2022Updated 3 years ago
- PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"☆144Oct 23, 2021Updated 4 years ago
- Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments☆13Nov 29, 2021Updated 4 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆235Jun 18, 2024Updated last year
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆44Mar 16, 2023Updated 3 years ago
- [ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"☆260Apr 27, 2026Updated 3 weeks ago
- Code and utilities for creating a Vision-and-Language Navigation (VLN) simulator environment from a physical space.☆12Nov 10, 2020Updated 5 years ago
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆241Apr 3, 2026Updated last month
- Code of the CVPR 2022 paper "HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation"☆31Aug 21, 2023Updated 2 years ago
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆44Apr 9, 2022Updated 4 years ago
- Reading list for research topics in embodied vision☆704Jun 13, 2025Updated 11 months ago
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆108Apr 2, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)☆27Jul 30, 2023Updated 2 years ago
- ☆34Aug 19, 2023Updated 2 years ago
- ☆57Apr 1, 2022Updated 4 years ago
- Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"☆15Dec 13, 2022Updated 3 years ago
- Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)☆59Oct 7, 2022Updated 3 years ago
- A mini-framework for running AI2-Thor with Docker.☆38Apr 26, 2024Updated 2 years ago
- Code and Data for Paper: PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation☆82May 31, 2023Updated 2 years ago