shubhamagarwal92 / visdial_convLinks

This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?

☆34

Alternatives and similar repositories for visdial_conv

Users that are interested in visdial_conv are comparing it to the libraries listed below

Sorting:

idansc / mrr-ndcg
☆18Updated last year
e-bug / volta
[TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…
☆114Updated 3 years ago
simpleshinobu / visdial-principles
Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"
☆31Updated 2 years ago
HKUST-KnowComp / VD-PCR
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Updated 3 years ago
HKUST-KnowComp / Visual_PCR
Dataset and Source code for EMNLP 2019 paper "What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues"
☆26Updated 4 years ago
salesforce / BiST
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Updated 5 months ago
vmurahari3 / visdial-bert
Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379
☆97Updated 5 years ago
yanxinzju / CSS-VQA
Counterfactual Samples Synthesizing for Robust VQA
☆79Updated 3 years ago
hwanheelee1993 / ViLBERTScore
Code for ViLBERTScore in EMNLP Eval4NLP
☆18Updated 3 years ago
yuleiniu / cfvqa
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
☆126Updated 3 years ago
wh0330 / CAG_VisDial
☆15Updated 5 years ago
hwanheelee1993 / UMIC
An unreferenced image captioning metric (ACL-21)
☆30Updated last year
necla-ml / SNLI-VE
Dataset and starting code for visual entailment dataset
☆118Updated 3 years ago
salesforce / VD-BERT
☆44Updated 5 months ago
gicheonkang / dan-visdial
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆45Updated 2 years ago
chrisc36 / bottom-up-attention-vqa
BottomUpTopDown VQA model with question-type debiasing
☆22Updated 6 years ago
CrossmodalGroup / SSL-VQA
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
☆52Updated 5 years ago
yuleiniu / rva
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Updated 2 years ago
phellonchen / DMRM
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog
☆25Updated 3 years ago
wenhuchen / Meta-Module-Network
Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"
☆43Updated 4 years ago
WadeYin9712 / GD-VCR
Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).
☆29Updated 4 years ago
mitjanikolaus / compositional-image-captioning
Code for the CoNLL 2019 paper "Compositional Generalization in Image Captioning" by Mitja Nikolaus, Mostafa Abdou, Matthew Lamm, Rahul Ar…
☆26Updated 5 years ago
cdancette / vqa-cp-leaderboard
A collections of papers about VQA-CP datasets and their results
☆41Updated 3 years ago
cooelf / UVR-NMT
Neural Machine Translation with universal Visual Representation (ICLR 2020)
☆89Updated 5 years ago
Zhiquan-Wen / D-VQA
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
☆27Updated 3 years ago
maximek3 / e-ViL
☆40Updated 3 years ago
volkancirik / groundnet
Repository for AAAI 2018 paper "Using Syntax for Referring Expression Recognition"
☆13Updated 5 years ago
badripatro / awesome-vqg
Visual Question Generation reading list
☆29Updated 5 years ago
ck0123 / improved-bertscore-for-image-captioning-evaluation
☆21Updated last year
quangvnai / visdial
Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)
☆29Updated 4 years ago