facebookresearch / EmbodiedQA
Train embodied agents that can answer questions in environments
☆306Updated last year
Alternatives and similar repositories for EmbodiedQA:
Users that are interested in EmbodiedQA are comparing it to the libraries listed below
- Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)☆236Updated 7 years ago
- Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"☆124Updated 5 years ago
- Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.☆132Updated 2 years ago
- PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning☆169Updated 6 years ago
- Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017☆271Updated 4 years ago
- PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation☆122Updated last year
- Cornell Touchdown natural language navigation and spatial reasoning dataset.☆99Updated 4 years ago
- Vision and Language Agent Navigation☆76Updated 4 years ago
- Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)☆186Updated 5 years ago
- PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"☆129Updated 3 years ago
- This repository provides code for reproducing experiments of the paper Talk The Walk: Navigating New York City Through Grounded Dialogue …☆110Updated 3 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆192Updated 2 years ago
- Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)☆501Updated 3 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆611Updated 3 years ago
- Neural Module Network for VQA in Pytorch☆107Updated 7 years ago
- Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"☆61Updated 5 years ago
- Neural-symbolic visual question answering☆263Updated 2 years ago
- [ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration☆203Updated 6 years ago
- Code for the habitat challenge☆325Updated 2 years ago
- An open source framework for research in Embodied-AI from AI2.☆344Updated 3 months ago
- [ICLR 2018] Tensorflow/Keras code for Semi-parametric Topological Memory for Navigation☆104Updated 6 years ago
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆294Updated 2 years ago
- Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Visuomotor Policies☆108Updated 2 years ago
- [CVPR 2017] Torch code for Visual Dialog☆228Updated 6 years ago
- BabyAI platform. A testbed for training agents to understand and execute language commands.☆727Updated last year
- PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation☆125Updated last year
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆466Updated 3 years ago
- [CVPR 2017] AMT chat interface code used to collect the Visual Dialog dataset☆79Updated 2 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆90Updated last year
- CoDraw dataset☆93Updated 6 years ago