jialinwu17 / MAVEXView external linksLinks
☆30Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for MAVEX
Users that are interested in MAVEX are comparing it to the libraries listed below
Sorting:
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 3 years ago
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆31Apr 30, 2024Updated last year
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆100Mar 30, 2023Updated 2 years ago
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Sep 4, 2021Updated 4 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- ☆10Jul 23, 2021Updated 4 years ago
- Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)☆48Nov 3, 2022Updated 3 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"☆12Jun 30, 2023Updated 2 years ago
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆26Feb 9, 2024Updated 2 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- Pytorch Implementation of MUCKO(2020 IJCAI)☆20Oct 25, 2020Updated 5 years ago
- ☆18Dec 8, 2022Updated 3 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆69Jul 11, 2022Updated 3 years ago
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Feb 9, 2024Updated 2 years ago
- ☆22Aug 10, 2020Updated 5 years ago
- [NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆105Apr 6, 2025Updated 10 months ago
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"☆187Apr 15, 2021Updated 4 years ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Jul 27, 2021Updated 4 years ago
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆22Nov 21, 2023Updated 2 years ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Feb 16, 2024Updated last year
- The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…☆17Jan 24, 2025Updated last year
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Mar 17, 2022Updated 3 years ago
- ☆40Nov 29, 2022Updated 3 years ago
- a multimodal retrieval dataset☆24Jul 8, 2023Updated 2 years ago
- ☆10Oct 14, 2020Updated 5 years ago
- [NeurIPS 2021] Introspective Distillation for Robust Question Answering☆13Dec 7, 2021Updated 4 years ago
- ☆44Jun 16, 2025Updated 8 months ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- Video Graph Transformer for Video Question Answering (ECCV'22)☆49Jun 8, 2023Updated 2 years ago
- [ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering☆13Nov 23, 2022Updated 3 years ago
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆41Sep 9, 2019Updated 6 years ago
- [Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning☆54Feb 9, 2024Updated 2 years ago
- Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)☆29Aug 5, 2021Updated 4 years ago
- [NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering☆14Jan 5, 2024Updated 2 years ago