LibertFan / TCICLinks

TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning in IJCAI2021.

☆9

Alternatives and similar repositories for TCIC

Users that are interested in TCIC are comparing it to the libraries listed below

Sorting:

CrossmodalGroup / SSL-VQA
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
☆51Updated 4 years ago
yanxinzju / CSS-VQA
Counterfactual Samples Synthesizing for Robust VQA
☆78Updated 2 years ago
Zhiquan-Wen / D-VQA
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
☆25Updated 2 years ago
daqingliu / NMTree
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
☆39Updated 5 years ago
HLR / Cross_Modality_Relevance
The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
☆27Updated 4 years ago
ezeli / BUTD_model
A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.
☆47Updated 3 years ago
yuleiniu / cfvqa
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
☆123Updated 3 years ago
CCYChongyanChen / VQA_AlgorithmDatasets
☆38Updated 2 years ago
entalent / MemCap
code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`
☆11Updated 5 years ago
wh0330 / CAG_VisDial
☆15Updated 4 years ago
qinzzz / Multimodal-Alignment-Framework
Implementation for MAF: Multimodal Alignment Framework
☆46Updated 4 years ago
AndersonStra / MuKEA
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
☆96Updated 2 years ago
cdancette / vqa-cp-leaderboard
A collections of papers about VQA-CP datasets and their results
☆39Updated 3 years ago
simpleshinobu / visdial-principles
Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"
☆32Updated 2 years ago
shubhamagarwal92 / visdial_conv
This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?
☆34Updated 2 years ago
zyang-ur / onestage_grounding
A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)
☆148Updated 4 years ago
jlian2 / mucko
Pytorch Implementation of MUCKO(2020 IJCAI)
☆20Updated 4 years ago
YiwuZhong / Sub-GC
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
☆97Updated 11 months ago
chrisc36 / bottom-up-attention-vqa
BottomUpTopDown VQA model with question-type debiasing
☆22Updated 5 years ago
jialinwu17 / MAVEX
☆30Updated 2 years ago
ruotianluo / coco-caption
☆67Updated 2 years ago
jokieleung / CL-VQA
the implementation of EMNLP 2020 "Learning to Contrast the Counterfactual Samples for Robust Visual Question Answering"
☆15Updated 3 years ago
mad-red / VSR-guided-CIC
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
☆36Updated 3 years ago
linjieli222 / VQA_ReGAT
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆184Updated 4 years ago
PhoebusSi / SAR
Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"
☆31Updated 3 years ago
wangpengnorman / FVQA
☆20Updated 4 years ago
salesforce / BiST
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Updated last month
idansc / mrr-ndcg
☆18Updated last year
SpencerWhitehead / novelvqa
☆27Updated 3 years ago
aioz-ai / ICCV19_VQA-CTI
Compact Trilinear Interaction for Visual Question Answering (ICCV 2019)
☆38Updated 2 years ago