Code for CVPR22 paper One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
☆13Jul 27, 2022Updated 3 years ago
Alternatives and similar repositories for M-Track
Users that are interested in M-Track are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆34Apr 23, 2023Updated 3 years ago
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆45Aug 6, 2024Updated last year
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)☆10Jul 17, 2022Updated 3 years ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official implementation of Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts(IJCAI 2024)☆15Oct 16, 2024Updated last year
- ☆33Aug 19, 2023Updated 2 years ago
- Code for A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation☆17Apr 25, 2024Updated 2 years ago
- Code of the paper "Correctable Landmark Discovery via Large Models for Vision-Language Navigation" (TPAMI 2024)☆16Jun 7, 2024Updated last year
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated last year
- Prompter for Embodied Instruction Following☆18Nov 30, 2023Updated 2 years ago
- [ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language☆17Dec 3, 2024Updated last year
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆93Jul 11, 2023Updated 2 years ago
- Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)☆59Oct 7, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].☆16Apr 13, 2023Updated 3 years ago
- Official repository of ICLR 2022 paper FILM: Following Instructions in Language with Modular Methods☆128Apr 9, 2023Updated 3 years ago
- Official implementation of the ECCV 2022 Oral paper: Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments☆35Dec 16, 2023Updated 2 years ago
- Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"☆15Dec 13, 2022Updated 3 years ago
- Code of the CVPR 2022 paper "HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation"☆31Aug 21, 2023Updated 2 years ago
- ☆110Feb 11, 2026Updated 2 months ago
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆26May 22, 2024Updated last year
- Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty☆23Dec 11, 2023Updated 2 years ago
- PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation☆122Oct 3, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- PyTorch code for the ACL 2020 paper: "BabyWalk: Going Farther in Vision-and-Language Navigationby Taking Baby Steps"☆42Apr 13, 2022Updated 4 years ago
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆30Aug 2, 2022Updated 3 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆29Dec 16, 2024Updated last year
- Fast-Slow Test-time Adaptation for Online Vision-and-Language Navigation☆34Dec 5, 2025Updated 5 months ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks☆511Feb 5, 2026Updated 3 months ago
- Official implementation of WebVLN: Vision-and-Language Navigation on Websites☆35Jan 2, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆44Apr 9, 2022Updated 4 years ago
- Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments☆13Nov 29, 2021Updated 4 years ago
- Enhance robot task understanding ability through visual semantic graph☆10May 20, 2021Updated 4 years ago
- ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing☆31Aug 30, 2021Updated 4 years ago
- ☆38Apr 2, 2024Updated 2 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- Repository for DialFRED.☆45Sep 14, 2023Updated 2 years ago