danielgordon10 / thor-iqa-cvpr-2018View external linksLinks
Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"
☆126Feb 11, 2020Updated 6 years ago
Alternatives and similar repositories for thor-iqa-cvpr-2018
Users that are interested in thor-iqa-cvpr-2018 are comparing it to the libraries listed below
Sorting:
- Train embodied agents that can answer questions in environments☆316Jul 25, 2023Updated 2 years ago
- Multi-Target Embodied Question Answering☆26Jul 17, 2020Updated 5 years ago
- Code for the paper "Representation Learning for Grounded Spatial Reasoning"☆52Jul 2, 2020Updated 5 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Aug 6, 2018Updated 7 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Sep 1, 2018Updated 7 years ago
- Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)☆195Feb 5, 2026Updated last week
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆181Oct 25, 2022Updated 3 years ago
- An open-source platform for Visual AI.☆1,657Nov 4, 2025Updated 3 months ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆163Feb 8, 2019Updated 7 years ago
- Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)☆82Jun 15, 2018Updated 7 years ago
- Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)☆238Apr 16, 2018Updated 7 years ago
- PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]☆56Oct 29, 2021Updated 4 years ago
- Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.☆138Nov 22, 2022Updated 3 years ago
- MINOS: Multimodal Indoor Simulator☆203Jan 11, 2023Updated 3 years ago
- Cooperative Vision-and-Dialog Navigation☆72Nov 22, 2022Updated 3 years ago
- [ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering☆207Mar 5, 2019Updated 6 years ago
- Code for the COG dataset and network☆44Oct 17, 2018Updated 7 years ago
- [ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering☆132Oct 25, 2022Updated 3 years ago
- Website for TextVQA dataset.☆28Apr 30, 2023Updated 2 years ago
- 3D household task-based dataset created using customised AI2-THOR.☆14Apr 14, 2022Updated 3 years ago
- RoboTHOR Challenge☆97May 17, 2021Updated 4 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Nov 17, 2019Updated 6 years ago
- [CVPR 2017] AMT chat interface code used to collect the Visual Dialog dataset☆78Jun 10, 2022Updated 3 years ago
- This repository contains code for the paper RMM: A Recursive Mental Model for Dialog Navigation☆10Nov 22, 2022Updated 3 years ago
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Oct 25, 2018Updated 7 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆643Aug 30, 2021Updated 4 years ago
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆297Nov 29, 2022Updated 3 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31May 30, 2018Updated 7 years ago
- ☆19Feb 6, 2019Updated 7 years ago
- Code for "Controllable Video Generation with Sparse Trajectories" in PyTorch☆45May 14, 2018Updated 7 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 2 years ago
- Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017☆272Jul 30, 2020Updated 5 years ago
- Pytorch implementation of Yolo V3☆11Aug 30, 2018Updated 7 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆765Mar 10, 2024Updated last year
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 7 years ago
- This repository provides code for reproducing experiments of the paper Talk The Walk: Navigating New York City Through Grounded Dialogue …☆110Aug 12, 2021Updated 4 years ago
- Neural-symbolic visual question answering☆280Mar 27, 2023Updated 2 years ago
- An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)☆25Jun 22, 2022Updated 3 years ago
- Visual Question Answering in Pytorch☆734Dec 11, 2019Updated 6 years ago