facebookresearch / MT-EQALinks
Multi-Target Embodied Question Answering
☆26Updated 5 years ago
Alternatives and similar repositories for MT-EQA
Users that are interested in MT-EQA are comparing it to the libraries listed below
Sorting:
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Updated 7 years ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆49Updated 5 years ago
- BISON: Binary Image SelectiON☆49Updated 4 years ago
- PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019☆47Updated 6 years ago
- Vision and Language Agent Navigation☆83Updated 4 years ago
- Visual Navigation with Natural Multimodal Assistance (EMNLP 2019)☆29Updated 5 years ago
- PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation☆125Updated 2 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Updated 6 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 9 years ago
- Website for TextVQA dataset.☆28Updated 2 years ago
- Generate a denotation graph from a set of image captions☆15Updated 7 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆55Updated 7 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Updated 6 years ago
- For visual commonsense model☆34Updated 6 years ago
- This repository has moved to: https://github.com/tkipf/c-swm☆27Updated 6 years ago
- Dataset and documentation for paper on explaining solutions to physical reasoning tasks (ESPRIT))☆21Updated 8 months ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆19Updated 4 years ago
- Code for EmBERT, a transformer model for embodied, language-guided visual task completion.☆59Updated last year
- ☆24Updated 9 years ago
- ☆20Updated 4 years ago
- Cornell Touchdown natural language navigation and spatial reasoning dataset.☆105Updated 5 years ago
- Implementation of Grounded Language Learning in a 3D Simulated World (DeepMind)☆34Updated 8 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Updated 4 years ago
- PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation☆122Updated 2 years ago
- ☆24Updated 4 years ago
- ☆16Updated 7 years ago
- PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]☆56Updated 4 years ago
- Memory, Attention and Composition (MAC) Network for CLEVR implemented in PyTorch☆85Updated 6 years ago
- Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"☆126Updated 5 years ago
- Project page for "Visual Grounding in Video for Unsupervised Word Translation" CVPR 2020☆42Updated 5 years ago