facebookresearch / EmbodiedQALinks

Train embodied agents that can answer questions in environments

☆312

Alternatives and similar repositories for EmbodiedQA

Users that are interested in EmbodiedQA are comparing it to the libraries listed below

Sorting:

danielgordon10 / thor-iqa-cvpr-2018
Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"
☆126Updated 5 years ago
batra-mlp-lab / visdial-rl
PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
☆169Updated 7 years ago
devendrachaplot / DeepRL-Grounding
Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)
☆238Updated 7 years ago
chihyaoma / selfmonitoring-agent
PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
☆122Updated 2 years ago
ronghanghu / n2nmn
Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017
☆272Updated 5 years ago
ronghanghu / speaker_follower
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
☆137Updated 2 years ago
facebookresearch / talkthewalk
This repository provides code for reproducing experiments of the paper Talk The Walk: Navigating New York City Through Grounded Dialogue …
☆110Updated 4 years ago
HarshTrivedi / nmn-pytorch
Neural Module Network for VQA in Pytorch
☆107Updated 7 years ago
batra-mlp-lab / visdial-challenge-starter-pytorch
Starter code in PyTorch for the Visual Dialog challenge
☆190Updated 2 years ago
Kelym / FAST
Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"
☆61Updated 6 years ago
chihyaoma / regretful-agent
PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation
☆125Updated 2 years ago
facebookresearch / clevr-dataset-gen
A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
☆631Updated 4 years ago
google-research / valan
Vision and Language Agent Navigation
☆82Updated 4 years ago
stanfordnlp / mac-network
Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)
☆508Updated 4 years ago
mesnico / RelationNetworks-CLEVR
A pytorch implementation for "A simple neural network module for relational reasoning", working on the CLEVR dataset
☆89Updated 5 years ago
airsplay / R2R-EnvDrop
PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"
☆136Updated 4 years ago
davidmascharka / tbd-nets
PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"
☆348Updated 3 years ago
batra-mlp-lab / visdial-amt-chat
[CVPR 2017] AMT chat interface code used to collect the Visual Dialog dataset
☆78Updated 3 years ago
kexinyi / ns-vqa
Neural-symbolic visual question answering
☆277Updated 2 years ago
Sha-Lab / babywalk
PyTorch code for the ACL 2020 paper: "BabyWalk: Going Farther in Vision-and-Language Navigationby Taking Baby Steps"
☆42Updated 3 years ago
abhshkdz / neural-vqa-attention
Attention-based Visual Question Answering in Torch
☆101Updated 8 years ago
facebookresearch / CoDraw
CoDraw dataset
☆93Updated 6 years ago
allenai / savn
Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)
☆192Updated 6 years ago
pathak22 / zeroshot-imitation
[ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration
☆203Updated 7 years ago
facebookresearch / habitat-challenge
Code for the habitat challenge
☆340Updated 2 years ago
markdtw / vqa-winner-cvprw-2017
Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17
☆163Updated 6 years ago
jiasenlu / visDial.pytorch
visual dialog model in pytorch
☆109Updated 7 years ago
lil-lab / drif
Dynamic Robot Instruction Following
☆36Updated 3 years ago
kdexd / probnmn-clevr
Code for ICML 2019 paper "Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering" [long-oral]
☆67Updated 2 years ago
Cyanogenoid / vqa-counting
[ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering
☆207Updated 6 years ago