chihyaoma / selfmonitoring-agentLinks

PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation

☆122

Alternatives and similar repositories for selfmonitoring-agent

Users that are interested in selfmonitoring-agent are comparing it to the libraries listed below

Sorting:

Kelym / FAST
Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"
☆61Updated 5 years ago
ronghanghu / speaker_follower
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
☆133Updated 2 years ago
airsplay / R2R-EnvDrop
PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"
☆132Updated 3 years ago
Sha-Lab / babywalk
PyTorch code for the ACL 2020 paper: "BabyWalk: Going Farther in Vision-and-Language Navigationby Taking Baby Steps"
☆42Updated 3 years ago
mmurray / cvdn
Cooperative Vision-and-Dialog Navigation
☆71Updated 2 years ago
chihyaoma / regretful-agent
PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation
☆126Updated last year
danielgordon10 / thor-iqa-cvpr-2018
Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"
☆124Updated 5 years ago
google-research / valan
Vision and Language Agent Navigation
☆81Updated 4 years ago
weituo12321 / PREVALENT
large scale pretrain for navigation task
☆92Updated 2 years ago
HanqingWangAI / Active_VLN
The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`
☆44Updated 3 years ago
lil-lab / touchdown
Cornell Touchdown natural language navigation and spatial reasoning dataset.
☆102Updated 4 years ago
YicongHong / Fine-Grained-R2R
Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation
☆49Updated 3 years ago
arjunmajum / vln-bert
Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)
☆56Updated 2 years ago
VegB / VLN-Transformer
Implementation of "Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation"
☆25Updated 4 years ago
alexpashevich / E.T.
Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…
☆91Updated 2 years ago
lil-lab / ciff
Cornell Instruction Following Framework
☆34Updated 3 years ago
allenai / savn
Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)
☆189Updated 5 years ago
ZhuFengdaaa / MG-AuxRN
code of the paper "Vision-Language Navigation with Multi-granularity Observation and Auxiliary Reasoning Tasks"
☆23Updated 4 years ago
zilongzheng / visdial-gnn
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
☆42Updated 4 years ago
YuankaiQi / REVERIE
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
☆135Updated last year
YicongHong / Entity-Graph-VLN
Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation
☆45Updated 3 years ago
sibeiyang / sgmn
Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.
☆116Updated 4 years ago
gistvision / moca
Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…
☆38Updated last year
satwikkottur / clevr-dialog
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
☆48Updated 5 years ago
daqingliu / awesome-vln
A curated list of research papers in Vision-Language Navigation (VLN)
☆218Updated last year
zhangybzbo / EnvBiasVLN
Feature resources of "Diagnosing the Environment Bias in Vision-and-Language Navigation"
☆17Updated 5 years ago
facebookresearch / EmbodiedQA
Train embodied agents that can answer questions in environments
☆309Updated 2 years ago
ronghanghu / snmn
Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018
☆71Updated 5 years ago
aimagelab / DynamicConv-agent
PyTorch code for BMVC 2019 paper: Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters
☆20Updated 2 years ago
YicongHong / Recurrent-VLN-BERT
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
☆182Updated 2 years ago