jialinwu17/MAVEX

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jialinwu17/MAVEX)

jialinwu17 / MAVEX

☆30

Alternatives and similar repositories for MAVEX

Users that are interested in MAVEX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
alirezasalemi7 / DEDR-MM-FiD
View on GitHub
the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering
☆14Aug 22, 2023Updated 2 years ago
ThalesGroup / ConceptBERT
View on GitHub
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated 2 years ago
PhoebusSi / Thinking-while-Observing
View on GitHub
Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"
☆12Jun 30, 2023Updated 3 years ago
AndersonStra / MuKEA
View on GitHub
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
☆101Mar 30, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
codexxxl / GraphVQA
View on GitHub
GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering
☆65Sep 4, 2021Updated 4 years ago
hackerchenzhuo / LaKo
View on GitHub
[Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection
☆24Feb 9, 2024Updated 2 years ago
jingchenchen / ReasoningConsistency-VQA
View on GitHub
☆13Aug 14, 2022Updated 3 years ago
Adam1679 / mutan-article-net
View on GitHub
Implementation of Mutan+ArticleNet on OKVQA
☆10Jan 11, 2021Updated 5 years ago
guilk / KAT
View on GitHub
Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"
☆71Jul 11, 2022Updated 4 years ago
aditya10 / VLC-BERT
View on GitHub
Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"
☆21May 8, 2023Updated 3 years ago
maryamziaa / ConceptBERT
View on GitHub
☆10Jul 23, 2021Updated 5 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HITsz-TMG / Cognitive-Visual-Language-Mapper
View on GitHub
The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…
☆17Jan 24, 2025Updated last year
China-UK-ZSL / ZS-F-VQA
View on GitHub
[Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph
☆72Feb 9, 2024Updated 2 years ago
aioz-ai / CFR_VQA
View on GitHub
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
☆48Apr 22, 2026Updated 3 months ago
wangpengnorman / FVQA
View on GitHub
☆22Aug 10, 2020Updated 5 years ago
luomancs / retriever_reader_for_okvqa
View on GitHub
☆19Dec 8, 2022Updated 3 years ago
yuanze-lin / REVIVE
View on GitHub
[NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
☆105Apr 6, 2025Updated last year
ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
HKUST-KnowComp / VD-PCR
View on GitHub
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Nov 1, 2022Updated 3 years ago
jialinwu17 / self_critical_vqa
View on GitHub
Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''
☆40Sep 9, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
luomancs / ReMuQ
View on GitHub
a multimodal retrieval dataset
☆25Jul 8, 2023Updated 3 years ago
LinWeizheDragon / Retrieval-Augmented-Visual-Question-Answering
View on GitHub
This is the official repository for Retrieval Augmented Visual Question Answering
☆252Dec 19, 2024Updated last year
UMass-Embodied-AGI / VisualCoT
View on GitHub
Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning
☆40Mar 12, 2025Updated last year
quangvnai / visdial
View on GitHub
Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)
☆29Aug 5, 2021Updated 4 years ago
shubhamagarwal92 / visdial_conv
View on GitHub
This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?
☆33Mar 24, 2023Updated 3 years ago
ItemZheng / KDDAug
View on GitHub
[ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering
☆13Nov 23, 2022Updated 3 years ago
gicheonkang / sglkt-visdial
View on GitHub
🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"
☆13Feb 1, 2023Updated 3 years ago
aurooj / SHG-VQA
View on GitHub
Learning Situation Hyper-Graphs for Video Question Answering
☆23Feb 16, 2024Updated 2 years ago
sail-sg / VGT
View on GitHub
Video Graph Transformer for Video Question Answering (ECCV'22)
☆49Jun 8, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
YuJungHeo / kbvqa-public
View on GitHub
☆40Nov 29, 2022Updated 3 years ago
szzexpoi / rex
View on GitHub
Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"
☆22Nov 21, 2023Updated 2 years ago
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 5 years ago
PaulLerner / ViQuAE
View on GitHub
Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…
☆39Dec 19, 2024Updated last year
wh0330 / CAG_VisDial
View on GitHub
☆15Aug 13, 2020Updated 5 years ago
AndersonStra / Mucko
View on GitHub
implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
☆10Mar 17, 2022Updated 4 years ago
microsoft / PICa
View on GitHub
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)
☆88Apr 10, 2022Updated 4 years ago