chingyaoc/awesome-vqa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chingyaoc/awesome-vqa)

chingyaoc / awesome-vqa

Visual Q&A reading list

☆439

Alternatives and similar repositories for awesome-vqa

Users that are interested in awesome-vqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cadene / vqa.pytorch
View on GitHub
Visual Question Answering in Pytorch
☆733Dec 11, 2019Updated 6 years ago
jiasenlu / HieCoAttenVQA
View on GitHub
☆351Oct 2, 2018Updated 7 years ago
chingyaoc / VQA-tensorflow
View on GitHub
Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering
☆98Apr 27, 2017Updated 9 years ago
akirafukui / vqa-mcb
View on GitHub
☆219Aug 13, 2016Updated 9 years ago
hengyuan-hu / bottom-up-attention-vqa
View on GitHub
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
☆768Mar 10, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zhoubolei / VQAbaseline
View on GitHub
Simple Baseline for Visual Question Answering
☆186Dec 21, 2016Updated 9 years ago
GT-Vision-Lab / VQA_LSTM_CNN
View on GitHub
Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multipl…
☆386Mar 22, 2019Updated 7 years ago
markdtw / vqa-winner-cvprw-2017
View on GitHub
Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17
☆163Feb 8, 2019Updated 7 years ago
peteanderson80 / bottom-up-attention
View on GitHub
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
☆1,470Feb 3, 2023Updated 3 years ago
zcyang / imageqa-san
View on GitHub
code for Stacked attention networks for image question answering
☆108Jan 7, 2017Updated 9 years ago
abhshkdz / neural-vqa-attention
View on GitHub
Attention-based Visual Question Answering in Torch
☆101Aug 13, 2017Updated 8 years ago
GT-Vision-Lab / VQA
View on GitHub
☆392Mar 11, 2021Updated 5 years ago
iamaaditya / VQA_Demo
View on GitHub
Visual Question Answering Demo on pretrained model
☆248Oct 31, 2025Updated 8 months ago
jokieleung / awesome-visual-question-answering
View on GitHub
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…
☆672Jul 6, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
chingyaoc / san-torch
View on GitHub
Torch implementation for Stacked Attention Networks
☆23Nov 24, 2016Updated 9 years ago
Cadene / block.bootstrap.pytorch
View on GitHub
BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models
☆354Dec 4, 2019Updated 6 years ago
KaihuaTang / VQA2.0-Recent-Approachs-2018.pytorch
View on GitHub
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…
☆300Jan 6, 2026Updated 6 months ago
hexiang-hu / answer_embedding
View on GitHub
Code Release for `Learning Answer Embeddings for Visual Question Answering`. (CVPR 2018)
☆13Apr 6, 2019Updated 7 years ago
jnhwkim / ban-vqa
View on GitHub
Bilinear attention networks for visual question answering
☆549Oct 30, 2023Updated 2 years ago
imatge-upc / vqa-2016-cvprw
View on GitHub
Visual question answering for CVPR16 VQA Challenge.
☆41Nov 5, 2016Updated 9 years ago
makarandtapaswi / MovieQA_CVPR2016
View on GitHub
Contains approaches introduced in the MovieQA benchmark dataset paper
☆78Nov 30, 2016Updated 9 years ago
aimbrain / vqa-project
View on GitHub
Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering
☆150Mar 11, 2019Updated 7 years ago
HyeonwooNoh / DPPnet
View on GitHub
DPPnet: Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction
☆96Apr 20, 2016Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
YunseokJANG / tgif-qa
View on GitHub
Repository for our CVPR 2017 and IJCV: TGIF-QA
☆180Sep 6, 2021Updated 4 years ago
peteanderson80 / SPICE
View on GitHub
Semantic Propositional Image Caption Evaluation
☆149Feb 2, 2023Updated 3 years ago
Cyanogenoid / vqa-counting
View on GitHub
[ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering
☆208Mar 5, 2019Updated 7 years ago
shtechair / vqa-sva
View on GitHub
Structured Attentions for Visual Question Answering
☆46Mar 4, 2018Updated 8 years ago
anantzoid / VQA-Keras-Visual-Question-Answering
View on GitHub
Visual Question Answering task written in Keras that answers questions about images
☆156May 10, 2019Updated 7 years ago
jnhwkim / nips-mrn-vqa
View on GitHub
Multimodal Residual Learning for Visual QA (NIPS 2016)
☆39Dec 27, 2016Updated 9 years ago
paarthneekhara / neural-vqa-tensorflow
View on GitHub
Visual Question Answering in Tensorflow.
☆229Nov 19, 2019Updated 6 years ago
Cyanogenoid / pytorch-vqa
View on GitHub
Strong baseline for visual question answering
☆240Mar 13, 2023Updated 3 years ago
Cadene / murel.bootstrap.pytorch
View on GitHub
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
☆194Feb 9, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
facebookresearch / mmf
View on GitHub
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
☆5,635Jul 7, 2026Updated 2 weeks ago
DeepRNN / visual_question_answering
View on GitHub
Tensorflow implementation of "Dynamic Memory Networks for Visual and Textual Question Answering"
☆79Mar 22, 2018Updated 8 years ago
MILVLG / mcan-vqa
View on GitHub
Deep Modular Co-Attention Networks for Visual Question Answering
☆459Dec 16, 2020Updated 5 years ago
SinghJasdeep / Attention-on-Attention-for-VQA
View on GitHub
Visual Question Answering Project with state of the art single Model performance.
☆130Jun 18, 2018Updated 8 years ago
jnhwkim / MulLowBiVQA
View on GitHub
Hadamard Product for Low-rank Bilinear Pooling
☆72Nov 6, 2017Updated 8 years ago
MILVLG / openvqa
View on GitHub
A lightweight, scalable, and general framework for visual question answering research
☆334Sep 3, 2021Updated 4 years ago
liuzhi136 / Visual-Question-Answering
View on GitHub
☆42Aug 18, 2016Updated 9 years ago