prdwb/okvqa-release

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/prdwb/okvqa-release)

prdwb / okvqa-release

☆15

Alternatives and similar repositories for okvqa-release

Users that are interested in okvqa-release are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
Adam1679 / mutan-article-net
View on GitHub
Implementation of Mutan+ArticleNet on OKVQA
☆10Jan 11, 2021Updated 5 years ago
luomancs / retriever_reader_for_okvqa
View on GitHub
☆19Dec 8, 2022Updated 3 years ago
yyyanglz / KAN
View on GitHub
Rich Visual Knowledge-based AugmentationNetwork for Visual Question Answering
☆10Dec 6, 2019Updated 6 years ago
jialinwu17 / MAVEX
View on GitHub
☆30Dec 16, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 4 years ago
ThalesGroup / ConceptBERT
View on GitHub
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated 2 years ago
jingjing12110 / MixPHM
View on GitHub
[CVPR 2023] Pytorch Code of MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering
☆17Jul 11, 2023Updated 3 years ago
val-iisc / RMLVQA
View on GitHub
☆19May 31, 2023Updated 3 years ago
NeverMoreLCH / Awesome-VQA
View on GitHub
A reading list of papers about Visual Question Answering.
☆35Aug 17, 2022Updated 3 years ago
haifangong / CMSA-MTPT-4-MedicalVQA
View on GitHub
[ICMR'21, Best Poster Paper Award] Medical Visual Question Answering with Multi-task Pre-training and Cross-modal Self-attention
☆34Dec 15, 2022Updated 3 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
XMUVQA / CapsAtt
View on GitHub
Project for Dynamic Capsule Attention
☆12Dec 7, 2019Updated 6 years ago
BierOne / relation-vqa
View on GitHub
Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.
☆12Mar 13, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xiaojino / RUArt
View on GitHub
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
sanket0211 / WK-VQA
View on GitHub
World Knowledge Based Visual Question Answering
☆22Nov 26, 2020Updated 5 years ago
salesforce / VD-BERT
View on GitHub
☆45Jun 16, 2025Updated last year
aioz-ai / CFR_VQA
View on GitHub
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
☆49Apr 22, 2026Updated 3 months ago
Cloud-CV / vilbert-multi-task
View on GitHub
12-in-1: Multi-Task Vision and Language Representation Learning Web Demo
☆35Dec 8, 2022Updated 3 years ago
BierOne / Attention-Faithfulness
View on GitHub
[ICML 2022] This is the pytorch implementation of "Rethinking Attention-Model Explainability through Faithfulness Violation Test" (https:…
☆20Jul 21, 2022Updated 4 years ago
zongshenmu / attention_knowledge_vqa
View on GitHub
vqa drived by bottom-up and top-down attention and knowledge
☆14Nov 21, 2018Updated 7 years ago
wangpengnorman / FVQA
View on GitHub
☆22Aug 10, 2020Updated 5 years ago
jokieleung / CL-VQA
View on GitHub
the implementation of EMNLP 2020 "Learning to Contrast the Counterfactual Samples for Robust Visual Question Answering"
☆15Sep 9, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
alirezasalemi7 / DEDR-MM-FiD
View on GitHub
the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering
☆14Aug 22, 2023Updated 2 years ago
AndersonStra / MuKEA
View on GitHub
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
☆101Mar 30, 2023Updated 3 years ago
Awenbocc / med-vqa
View on GitHub
Medical Visual Question Answering via Conditional Reasoning [ACM MM 2020]
☆64Aug 20, 2021Updated 4 years ago
yanxinzju / CSS-VQA
View on GitHub
Counterfactual Samples Synthesizing for Robust VQA
☆78Nov 24, 2022Updated 3 years ago
alibabadoufu / dynamic_fusion_reimplementation
View on GitHub
Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
☆17Oct 30, 2019Updated 6 years ago
sergiotasconmorales / consistency_vqa
View on GitHub
Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)
☆26Mar 28, 2023Updated 3 years ago
JoshuaGhost / expred
View on GitHub
☆10Jul 24, 2023Updated 2 years ago
yangxuntu / lxmertcatt
View on GitHub
☆79Oct 8, 2022Updated 3 years ago
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jlian2 / mucko
View on GitHub
Pytorch Implementation of MUCKO(2020 IJCAI)
☆20Oct 25, 2020Updated 5 years ago
microsoft / PICa
View on GitHub
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)
☆88Apr 10, 2022Updated 4 years ago
shengyuzhang / DeVLBert
View on GitHub
DeVLBert: Learning Deconfounded Visio-Linguistic Representations
☆27Nov 27, 2022Updated 3 years ago
allenai / x-lxmert
View on GitHub
PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"
☆50Aug 27, 2021Updated 4 years ago
jiasenlu / bottom-up-attention
View on GitHub
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
☆23Aug 22, 2019Updated 6 years ago
naoya-i / r4c
View on GitHub
r4c
☆14Mar 2, 2021Updated 5 years ago
hotaki-lab / Product-Review-Sentiment-Analysis
View on GitHub
The goal of this project is to design a classifier to use for sentiment analysis of product reviews. Our training set consists of reviews…
☆10Jul 8, 2021Updated 5 years ago