☆30Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for MAVEX
Users that are interested in MAVEX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 3 years ago
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆31Apr 30, 2024Updated 2 years ago
- Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"☆12Jun 30, 2023Updated 2 years ago
- ☆15May 10, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆101Mar 30, 2023Updated 3 years ago
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Sep 4, 2021Updated 4 years ago
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆24Feb 9, 2024Updated 2 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- Pytorch Implementation of MUCKO(2020 IJCAI)☆20Oct 25, 2020Updated 5 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 3 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆70Jul 11, 2022Updated 3 years ago
- ☆10Jul 23, 2021Updated 4 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆27Oct 7, 2021Updated 4 years ago
- The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…☆18Jan 24, 2025Updated last year
- Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)☆49Apr 22, 2026Updated 2 weeks ago
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Feb 9, 2024Updated 2 years ago
- ☆22Aug 10, 2020Updated 5 years ago
- ☆19Dec 8, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- [NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆105Apr 6, 2025Updated last year
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- a multimodal retrieval dataset☆24Jul 8, 2023Updated 2 years ago
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆41Sep 9, 2019Updated 6 years ago
- This is the official repository for Retrieval Augmented Visual Question Answering☆248Dec 19, 2024Updated last year
- Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"☆187Apr 15, 2021Updated 5 years ago
- This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?☆33Mar 24, 2023Updated 3 years ago
- [ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering☆13Nov 23, 2022Updated 3 years ago
- 🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"☆13Feb 1, 2023Updated 3 years ago
- Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)☆29Aug 5, 2021Updated 4 years ago
- [Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning☆54Feb 9, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Learning Situation Hyper-Graphs for Video Question Answering☆23Feb 16, 2024Updated 2 years ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆49Jun 8, 2023Updated 2 years ago
- ☆40Nov 29, 2022Updated 3 years ago
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆22Nov 21, 2023Updated 2 years ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Dec 19, 2024Updated last year
- ☆15Aug 13, 2020Updated 5 years ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Jul 27, 2021Updated 4 years ago