Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"
☆126Feb 11, 2020Updated 6 years ago
Alternatives and similar repositories for thor-iqa-cvpr-2018
Users that are interested in thor-iqa-cvpr-2018 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train embodied agents that can answer questions in environments☆316Jul 25, 2023Updated 2 years ago
- Multi-Target Embodied Question Answering☆26Jul 17, 2020Updated 5 years ago
- An open-source platform for Visual AI.☆1,691Nov 4, 2025Updated 4 months ago
- 3D household task-based dataset created using customised AI2-THOR.☆14Apr 14, 2022Updated 3 years ago
- Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)☆238Apr 16, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository contains code for the paper RMM: A Recursive Mental Model for Dialog Navigation☆10Nov 22, 2022Updated 3 years ago
- Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)☆194Feb 20, 2026Updated last month
- RoboTHOR Challenge☆97May 17, 2021Updated 4 years ago
- Code for the paper "Representation Learning for Grounded Spatial Reasoning"☆52Jul 2, 2020Updated 5 years ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆163Feb 8, 2019Updated 7 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Aug 6, 2018Updated 7 years ago
- Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.☆137Nov 22, 2022Updated 3 years ago
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆182Oct 25, 2022Updated 3 years ago
- [ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering☆207Mar 5, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 8 years ago
- MINOS: Multimodal Indoor Simulator☆203Jan 11, 2023Updated 3 years ago
- VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation☆23Aug 1, 2017Updated 8 years ago
- An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)☆25Jun 22, 2022Updated 3 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Sep 1, 2018Updated 7 years ago
- ☆13Dec 8, 2022Updated 3 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 2 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆645Aug 30, 2021Updated 4 years ago
- Code for the COG dataset and network☆44Oct 17, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks☆501Feb 5, 2026Updated last month
- PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]☆56Oct 29, 2021Updated 4 years ago
- Neural-symbolic visual question answering☆280Mar 27, 2023Updated 3 years ago
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆298Nov 29, 2022Updated 3 years ago
- This repository provides code for reproducing experiments of the paper Talk The Walk: Navigating New York City Through Grounded Dialogue …☆110Aug 12, 2021Updated 4 years ago
- Cooperative Vision-and-Dialog Navigation☆72Nov 22, 2022Updated 3 years ago
- Generate captions for an image using convolutional and recurrent networks☆12Feb 25, 2016Updated 10 years ago
- Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017☆272Jul 30, 2020Updated 5 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆766Mar 10, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official code for the ACL 2021 Findings paper "Yichi Zhang and Joyce Chai. Hierarchical Task Learning from Language Instructions with Uni…☆24Jun 28, 2021Updated 4 years ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆49Feb 18, 2020Updated 6 years ago
- [CVPR 2017] AMT chat interface code used to collect the Visual Dialog dataset☆78Jun 10, 2022Updated 3 years ago
- a Realistic and Rich 3D Environment☆1,202Jul 6, 2020Updated 5 years ago
- Multimodal Compact Bilinear Pooling for Torch7☆69Jan 2, 2017Updated 9 years ago
- Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)☆82Jun 15, 2018Updated 7 years ago
- PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"☆144Oct 23, 2021Updated 4 years ago