Sha-Lab / babywalkLinks

PyTorch code for the ACL 2020 paper: "BabyWalk: Going Farther in Vision-and-Language Navigationby Taking Baby Steps"

☆42

Alternatives and similar repositories for babywalk

Users that are interested in babywalk are comparing it to the libraries listed below

Sorting:

chihyaoma / selfmonitoring-agent
PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
☆122Updated 2 years ago
airsplay / R2R-EnvDrop
PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"
☆143Updated 4 years ago
mmurray / cvdn
Cooperative Vision-and-Dialog Navigation
☆71Updated 3 years ago
ronghanghu / speaker_follower
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
☆138Updated 3 years ago
Kelym / FAST
Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"
☆62Updated 6 years ago
arjunmajum / vln-bert
Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)
☆58Updated 3 years ago
lil-lab / ciff
Cornell Instruction Following Framework
☆34Updated 4 years ago
weituo12321 / PREVALENT
large scale pretrain for navigation task
☆93Updated 2 years ago
zzxslp / XL-VLN
Dataset for Bilingual VLN
☆11Updated 5 years ago
google-research / valan
Vision and Language Agent Navigation
☆82Updated 4 years ago
zhangybzbo / EnvBiasVLN
Feature resources of "Diagnosing the Environment Bias in Vision-and-Language Navigation"
☆16Updated 5 years ago
HanqingWangAI / Active_VLN
The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`
☆43Updated 3 years ago
lil-lab / touchdown
Cornell Touchdown natural language navigation and spatial reasoning dataset.
☆104Updated 5 years ago
YicongHong / Fine-Grained-R2R
Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation
☆50Updated 4 years ago
VegB / VLN-Transformer
Implementation of "Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation"
☆26Updated 4 years ago
danielgordon10 / thor-iqa-cvpr-2018
Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"
☆126Updated 5 years ago
ZhuFengdaaa / MG-AuxRN
code of the paper "Vision-Language Navigation with Multi-granularity Observation and Auxiliary Reasoning Tasks"
☆23Updated 4 years ago
alexpashevich / E.T.
Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…
☆93Updated 2 years ago
YicongHong / Entity-Graph-VLN
Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation
☆46Updated 4 years ago
gistvision / moca
Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…
☆40Updated last year
chihyaoma / regretful-agent
PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation
☆125Updated 2 years ago
daqingliu / awesome-vln
A curated list of research papers in Vision-Language Navigation (VLN)
☆231Updated last year
YuankaiQi / REVERIE
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
☆142Updated 2 years ago
ccvl / clevr-refplus-dataset-gen
A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
☆26Updated 3 years ago
satwikkottur / clevr-dialog
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
☆49Updated 5 years ago
594zyc / HiTUT
Official code for the ACL 2021 Findings paper "Yichi Zhang and Joyce Chai. Hierarchical Task Learning from Language Instructions with Uni…
☆25Updated 4 years ago
YicongHong / Recurrent-VLN-BERT
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
☆198Updated 3 years ago
yuleiniu / rva
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Updated 2 years ago
HanqingWangAI / SSM-VLN
Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"
☆42Updated 4 years ago
lichengunc / refer-parser2
Referring Expression Parser
☆27Updated 7 years ago