alirezasalemi7 / DEDR-MM-FiD
the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering
☆11Updated last year
Related projects: ⓘ
- ☆12Updated last year
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆25Updated 7 months ago
- Official implementation for the MM'22 paper.☆11Updated 2 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆87Updated last year
- Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'☆19Updated 9 months ago
- Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)☆43Updated last year
- PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)☆22Updated last year
- Video Graph Transformer for Video Question Answering (ECCV'22)☆44Updated last year
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆65Updated 2 years ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆21Updated 5 months ago
- ☆26Updated last year
- Recent Advances in Visual Dialog☆29Updated 2 years ago
- a multimodal retrieval dataset☆21Updated last year
- The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering☆19Updated 2 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Updated 2 years ago
- Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training☆18Updated last year
- The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.☆12Updated 8 months ago
- ☆23Updated last year
- ☆10Updated this week
- Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".☆41Updated 2 years ago
- Pytorch Implementation of MUCKO(2020 IJCAI)☆19Updated 3 years ago
- ☆11Updated 2 years ago
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Updated last year
- Code for our EMNLP-2022 paper: "Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning"☆12Updated last year
- An PyTorch reimplementation of bottom-up-attention models☆16Updated 3 years ago
- ☆18Updated last year
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆15Updated 4 months ago
- NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)☆120Updated last month
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21Updated last year
- ☆77Updated last year