China-UK-ZSL / ZS-F-VQA
[Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph
☆65Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for ZS-F-VQA
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆25Updated 9 months ago
- ☆37Updated last year
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Updated 2 years ago
- Repository for VisualSem: a high-quality knowledge graph to support research in vision and language.☆87Updated 2 years ago
- Pytorch Implementation of MUCKO(2020 IJCAI)☆19Updated 4 years ago
- Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)☆69Updated last year
- ☆19Updated 4 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆88Updated last year
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆90Updated 2 years ago
- Multimodal entity linking (MEL) aims to utilize multimodal information to map mentions to corresponding entities defined in knowledge bas…☆80Updated 3 years ago
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆56Updated 3 years ago
- ☆26Updated last year
- Multimodal entity linking for Tweets☆28Updated 3 years ago
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆116Updated 2 years ago
- [NAACL 2022 Findings] Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extrac…☆97Updated last year
- ☆10Updated 3 years ago
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆28Updated 6 months ago
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆26Updated 3 years ago
- Resource and Code for ICME 2021 paper "MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social…☆49Updated 2 years ago
- PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)☆22Updated 2 years ago
- Multi-modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance☆65Updated 3 weeks ago
- Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering☆48Updated 4 years ago
- Counterfactual Samples Synthesizing for Robust VQA☆76Updated last year
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated last year
- [ACMMM 2022] Learning from Different text-image Pairs: A Relation-enhanced Graph Convolutional Network for Multimodal NER☆14Updated last year
- The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering☆19Updated 2 years ago
- 东南大学多模态知识图谱-OpenRichpedia工程文件☆28Updated 3 years ago
- ☆100Updated 2 years ago
- ☆32Updated last year