NeverMoreLCH/Awesome-VQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NeverMoreLCH/Awesome-VQA)

NeverMoreLCH / Awesome-VQA

A reading list of papers about Visual Question Answering.

☆36

Alternatives and similar repositories for Awesome-VQA

Users that are interested in Awesome-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Adam1679 / mutan-article-net
View on GitHub
Implementation of Mutan+ArticleNet on OKVQA
☆10Jan 11, 2021Updated 5 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
MILVLG / rosita
View on GitHub
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
☆57Jun 13, 2023Updated 3 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sergiotasconmorales / consistency_vqa
View on GitHub
Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)
☆26Mar 28, 2023Updated 3 years ago
AndersonStra / Mucko
View on GitHub
implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
☆10Mar 17, 2022Updated 4 years ago
haifangong / CMSA-MTPT-4-MedicalVQA
View on GitHub
[ICMR'21, Best Poster Paper Award] Medical Visual Question Answering with Multi-task Pre-training and Cross-modal Self-attention
☆34Dec 15, 2022Updated 3 years ago
jokieleung / awesome-visual-question-answering
View on GitHub
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…
☆672Jul 6, 2023Updated 3 years ago
guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
YuJungHeo / kbvqa-public
View on GitHub
☆40Nov 29, 2022Updated 3 years ago
Taaccoo / awesome-vqa-latest
View on GitHub
Visual Question Answering Paper List.
☆52Aug 19, 2022Updated 3 years ago
noagarcia / ROLL-VideoQA
View on GitHub
PyTorch code for ROLL, a knowledge-based video story question answering model.
☆21Sep 29, 2020Updated 5 years ago
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
The-AI-Summer / Hugging_Face_tutorials
View on GitHub
Hugging Face tutorials
☆15Jun 3, 2021Updated 5 years ago
CCYChongyanChen / VQA_AlgorithmDatasets
View on GitHub
☆37Jan 20, 2023Updated 3 years ago
AdrienLE / loss_kaggle_2018
View on GitHub
Investigation of focal and dice loss for the Kaggle 2018 data science bowl.
☆18Mar 6, 2018Updated 8 years ago
Axe-- / Visual-Question-Answering
View on GitHub
PyTorch Implementation of VQA Baseline & Hierarchical Co-Attention model
☆16Oct 3, 2023Updated 2 years ago
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 5 years ago
HLR / Cross_Modality_Relevance
View on GitHub
The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
☆27May 6, 2021Updated 5 years ago
abachaa / VQA-Med-2020
View on GitHub
VQA-Med 2020
☆16May 13, 2026Updated 2 months ago
wangpengnorman / FVQA
View on GitHub
☆22Aug 10, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sutdcv / SUTD-TrafficQA
View on GitHub
[CVPR 2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
☆66Feb 9, 2026Updated 5 months ago
cdancette / rubi.bootstrap.pytorch
View on GitHub
NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering
☆66Mar 29, 2021Updated 5 years ago
NeverMoreLCH / Awesome-Video-Grounding
View on GitHub
A reading list of papers about Visual Grounding.
☆31Aug 24, 2022Updated 3 years ago
yl3800 / EIGV
View on GitHub
☆15Aug 12, 2022Updated 3 years ago
China-UK-ZSL / ZS-F-VQA
View on GitHub
[Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph
☆72Feb 9, 2024Updated 2 years ago
thaolmk54 / hcrn-videoqa
View on GitHub
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆135Jul 25, 2024Updated 2 years ago
MILVLG / openvqa
View on GitHub
A lightweight, scalable, and general framework for visual question answering research
☆334Sep 3, 2021Updated 4 years ago
VirajBagal / MMBERT
View on GitHub
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
☆39Mar 22, 2021Updated 5 years ago
antoyang / just-ask
View on GitHub
[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
☆127Sep 29, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
weiweisong415 / Demo_DHCNN_for_TGRS2021
View on GitHub
A novel deep hashing method (DHCNN) for remote sensing image retrieval and classification, which was pulished in IEEE Trans. Geosci. Remo…
☆10Mar 23, 2022Updated 4 years ago
AwalkZY / CPN
View on GitHub
Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”
☆10Apr 3, 2022Updated 4 years ago
Gary-code / KECVQG
View on GitHub
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆10Sep 3, 2024Updated last year
JerryWisdom / react-vqa-master
View on GitHub
基于 React + router + redux + axios 和 Flask + MySQL + Pytorch 的视觉问答管理系统
☆10Dec 12, 2022Updated 3 years ago
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
jlian2 / mucko
View on GitHub
Pytorch Implementation of MUCKO(2020 IJCAI)
☆20Oct 25, 2020Updated 5 years ago