danielgordon10 / thor-iqa-cvpr-2018Links

Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"

☆126

Alternatives and similar repositories for thor-iqa-cvpr-2018

Users that are interested in thor-iqa-cvpr-2018 are comparing it to the libraries listed below

Sorting:

chihyaoma / selfmonitoring-agent
PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
☆122Updated 2 years ago
Kelym / FAST
Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"
☆61Updated 6 years ago
chihyaoma / regretful-agent
PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation
☆125Updated 2 years ago
ronghanghu / speaker_follower
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
☆138Updated 2 years ago
airsplay / R2R-EnvDrop
PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"
☆139Updated 4 years ago
allenai / savn
Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)
☆192Updated 6 years ago
Sha-Lab / babywalk
PyTorch code for the ACL 2020 paper: "BabyWalk: Going Farther in Vision-and-Language Navigationby Taking Baby Steps"
☆42Updated 3 years ago
lil-lab / touchdown
Cornell Touchdown natural language navigation and spatial reasoning dataset.
☆103Updated 5 years ago
facebookresearch / EmbodiedQA
Train embodied agents that can answer questions in environments
☆313Updated 2 years ago
lil-lab / ciff
Cornell Instruction Following Framework
☆34Updated 4 years ago
google-research / valan
Vision and Language Agent Navigation
☆82Updated 4 years ago
Cold-Winter / vqs
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
☆23Updated 8 years ago
nexusapoorvacus / DeepVariationStructuredRL
A PyTorch implementation of the "Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection" paper …
☆63Updated 6 years ago
lil-lab / drif
Dynamic Robot Instruction Following
☆37Updated 3 years ago
zilongzheng / visdial-gnn
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
☆42Updated 4 years ago
mmurray / cvdn
Cooperative Vision-and-Dialog Navigation
☆71Updated 2 years ago
batra-mlp-lab / visdial-rl
PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
☆169Updated 7 years ago
devendrachaplot / DeepRL-Grounding
Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)
☆238Updated 7 years ago
rl-lang-grounding / rl-lang-ground
Tensorflow code for WACV 2019 paper "Attention Based Natural Language Grounding by Navigating Virtual Environment" - https://arxiv.org/ab…
☆17Updated 7 years ago
ronghanghu / snmn
Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018
☆71Updated 6 years ago
SamsonYuBaiJian / actionet
3D household task-based dataset created using customised AI2-THOR.
☆15Updated 3 years ago
allenai / cordial-sync
cordial-sync is a software package than can be used to reproduce the results from the paper "A Cordial Sync: Going Beyond Marginal Polici…
☆40Updated 4 years ago
gistvision / moca
Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…
☆39Updated last year
DmZhukov / CrossTask
☆93Updated 3 years ago
Glaciohound / VCML
PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019
☆47Updated 5 years ago
catalina17 / VideoNavQA
An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
☆25Updated 3 years ago
satwikkottur / clevr-dialog
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
☆49Updated 5 years ago
aimagelab / DynamicConv-agent
PyTorch code for BMVC 2019 paper: Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters
☆20Updated 2 years ago
lil-lab / chalet
Cornell House Agent Learning Environment
☆47Updated 3 years ago
MohitShridhar / ingress
Visual Grounding of Referring Expressions for Human-Robot Interaction
☆26Updated 7 years ago