arijitray1993 / VQARelevance
Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
☆15Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for VQARelevance
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 7 years ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆44Updated 4 years ago
- Generate a denotation graph from a set of image captions☆15Updated 6 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Updated 5 years ago
- For visual commonsense model☆34Updated 5 years ago
- Multi-Target Embodied Question Answering☆11Updated 5 years ago
- PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019☆49Updated 4 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Updated 6 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆32Updated 5 years ago
- ☆15Updated 6 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆72Updated 4 years ago
- implement n2nmn with pytorch☆19Updated 5 years ago
- PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations☆42Updated 3 years ago
- This repository has moved to: https://github.com/tkipf/c-swm☆27Updated 4 years ago
- Visual Navigation with Natural Multimodal Assistance (EMNLP 2019)☆27Updated 4 years ago
- Dataset and documentation for paper on explaining solutions to physical reasoning tasks (ESPRIT))☆21Updated 3 years ago
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆57Updated 6 years ago
- Code release for Park et al. Multimodal Multimodal Explanations: Justifying Decisions and Pointing to the Evidence. in CVPR, 2018☆49Updated 6 years ago
- Learning with latent language☆50Updated 3 years ago
- Scene Graph Parsing as Dependency Parsing☆41Updated 5 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15Updated 3 years ago
- Multi-Target Embodied Question Answering☆26Updated 4 years ago
- Repository for our ICLR 2018 paper: memoryGAN☆47Updated 6 years ago
- ☆29Updated 6 years ago
- Implementation of Grounded Language Learning in a 3D Simulated World (DeepMind)☆34Updated 7 years ago
- Solving reinforcement learning tasks which require language and vision☆32Updated last year
- Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Parsing"☆12Updated 4 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆20Updated 6 years ago