AishwaryaAgrawal / GVQALinks

Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering

☆26

Alternatives and similar repositories for GVQA

Users that are interested in GVQA are comparing it to the libraries listed below

Sorting:

shijx12 / XNM-Net
Pytorch implementation of "Explainable and Explicit Visual Reasoning over Scene Graphs "
☆93Updated 6 years ago
yuleiniu / rva
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Updated 2 years ago
cdancette / rubi.bootstrap.pytorch
NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering
☆65Updated 4 years ago
fidler-lab / Caption-Lifetime-by-Asking-Questions
PyTorch code for Learning to Caption Images through a Lifetime by Asking Questions (ICCV 2019)
☆16Updated 6 years ago
nocaps-org / updown-baseline
Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".
☆76Updated 2 years ago
Seth-Park / MultimodalExplanations
Code release for Park et al. Multimodal Multimodal Explanations: Justifying Decisions and Pointing to the Evidence. in CVPR, 2018
☆48Updated 7 years ago
ruotianluo / DiscCaptioning
Code for Discriminability objective for training descriptive captions(CVPR 2018)
☆109Updated 6 years ago
hassanhub / MultiGrounding
This is the repo for Multi-level textual grounding
☆34Updated 5 years ago
yiyang92 / vae_captioning
Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
☆59Updated 7 years ago
Yusics / bist-parser
Scene Graph Parsing as Dependency Parsing
☆41Updated 6 years ago
AmingWu / CCN
Connective Cognition Network for Directional Visual Commonsense Reasoning
☆15Updated 4 years ago
aimbrain / vqa-project
Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering
☆150Updated 6 years ago
yuweijiang / HGL-pytorch
Code for the model "Heterogeneous Graph Learning for Visual Commonsense Reasoning (NeurlPS 2019)"
☆47Updated 5 years ago
ruotianluo / cider
python codes for CIDEr - Consensus-based Image Caption Evaluation
☆32Updated 6 years ago
yikang-li / iQAN
Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)
☆82Updated 7 years ago
Cyanogenoid / vqa-counting
[ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering
☆207Updated 6 years ago
jialinwu17 / self_critical_vqa
Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''
☆41Updated 6 years ago
YuanEZhou / Grounded-Image-Captioning
☆64Updated 3 years ago
cvlab-tohoku / Dense-CoAttention-Network
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
☆107Updated 6 years ago
lichengunc / speaker_listener_reinforcer
Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension
☆34Updated 7 years ago
rakshithShetty / captionGAN
Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"
☆66Updated 6 years ago
daqingliu / CAVP
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…
☆46Updated 6 years ago
gicheonkang / dan-visdial
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆45Updated 2 years ago
ronghanghu / snmn
Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018
☆71Updated 6 years ago
bezorro / ACMN-Pytorch
Visual Question Reasoning on General Dependency Tree
☆30Updated 7 years ago
SeleenaJM / CapEval
An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)
☆37Updated 5 years ago
peteanderson80 / SPICE
Semantic Propositional Image Caption Evaluation
☆144Updated 2 years ago
chihyaoma / cyclical-visual-captioning
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Updated 5 years ago
fanchenyou / HME-VideoQA
Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA
☆54Updated 4 years ago
lichengunc / refer-parser2
Referring Expression Parser
☆27Updated 7 years ago