GT-Vision-Lab/VQA_LSTM_CNN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GT-Vision-Lab/VQA_LSTM_CNN)

GT-Vision-Lab / VQA_LSTM_CNN

Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multiple-Choice on test-standard.

☆386

Alternatives and similar repositories for VQA_LSTM_CNN

Users that are interested in VQA_LSTM_CNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jiasenlu / HieCoAttenVQA
View on GitHub
☆351Oct 2, 2018Updated 7 years ago
zhoubolei / VQAbaseline
View on GitHub
Simple Baseline for Visual Question Answering
☆186Dec 21, 2016Updated 9 years ago
GT-Vision-Lab / VQA
View on GitHub
☆392Mar 11, 2021Updated 5 years ago
akirafukui / vqa-mcb
View on GitHub
☆219Aug 13, 2016Updated 9 years ago
chingyaoc / VQA-tensorflow
View on GitHub
Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering
☆98Apr 27, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HyeonwooNoh / DPPnet
View on GitHub
DPPnet: Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction
☆96Apr 20, 2016Updated 10 years ago
avisingh599 / visual-qa
View on GitHub
[Reimplementation Antol et al 2015] Keras-based LSTM/CNN models for Visual Question Answering
☆479Jun 11, 2018Updated 8 years ago
chingyaoc / san-torch
View on GitHub
Torch implementation for Stacked Attention Networks
☆23Nov 24, 2016Updated 9 years ago
chingyaoc / awesome-vqa
View on GitHub
Visual Q&A reading list
☆439Oct 7, 2018Updated 7 years ago
renmengye / imageqa-public
View on GitHub
Code for paper "Exploring Models and Data for Image Question Answering"
☆81Mar 23, 2016Updated 10 years ago
zcyang / imageqa-san
View on GitHub
code for Stacked attention networks for image question answering
☆108Jan 7, 2017Updated 9 years ago
jnhwkim / MulLowBiVQA
View on GitHub
Hadamard Product for Low-rank Bilinear Pooling
☆72Nov 6, 2017Updated 8 years ago
Cadene / vqa.pytorch
View on GitHub
Visual Question Answering in Pytorch
☆733Dec 11, 2019Updated 6 years ago
jacobandreas / nmn2
View on GitHub
Neural module networks
☆401Jul 7, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
anantzoid / VQA-Keras-Visual-Question-Answering
View on GitHub
Visual Question Answering task written in Keras that answers questions about images
☆156May 10, 2019Updated 7 years ago
iamaaditya / VQA_Demo
View on GitHub
Visual Question Answering Demo on pretrained model
☆248Oct 31, 2025Updated 8 months ago
yukezhu / visual7w-qa-models
View on GitHub
Visual7W visual question answering models
☆65Oct 8, 2019Updated 6 years ago
mateuszmalinowski / visual_turing_test-tutorial
View on GitHub
Tutorial for Visual Turing Test (visual question answering, image question answering).
☆118Jan 30, 2017Updated 9 years ago
coreylynch / grid-lstm
View on GitHub
Torch7 implementation of Grid LSTM as described here: http://arxiv.org/pdf/1507.01526v2.pdf
☆186Feb 10, 2016Updated 10 years ago
yangky11 / CNN-Color2Gray
View on GitHub
An implementation of Color2Gray with convolutional neural networks
☆11Dec 23, 2015Updated 10 years ago
hengyuan-hu / bottom-up-attention-vqa
View on GitHub
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
☆768Mar 10, 2024Updated 2 years ago
abhshkdz / neural-vqa
View on GitHub
Visual Question Answering in Torch
☆486May 3, 2016Updated 10 years ago
SinghJasdeep / Attention-on-Attention-for-VQA
View on GitHub
Visual Question Answering Project with state of the art single Model performance.
☆130Jun 18, 2018Updated 8 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
deltheil / vlfeat.torch
View on GitHub
VLFeat (partial) FFI wrapper for Torch7
☆12Mar 23, 2016Updated 10 years ago
jnhwkim / cbp
View on GitHub
Multimodal Compact Bilinear Pooling for Torch7
☆70Jan 2, 2017Updated 9 years ago
taey16 / image-encoder
View on GitHub
image encoder
☆13Sep 19, 2016Updated 9 years ago
yueatsprograms / Stochastic_Depth
View on GitHub
Deep Networks with Stochastic Depth
☆479Aug 13, 2018Updated 7 years ago
willwhitney / understanding-visual-concepts
View on GitHub
Unsupervised learning of visual concepts from video
☆56May 5, 2016Updated 10 years ago
Cyanogenoid / vqa-counting
View on GitHub
[ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering
☆208Mar 5, 2019Updated 7 years ago
GT-Vision-Lab / abstract_scenes_v002
View on GitHub
The second version of the interface for Abstract Scenes research project.
☆23May 16, 2022Updated 4 years ago
ruotianluo / Faster-RCNN-Densecap-torch
View on GitHub
Faster-RCNN based on Densecap(deprecated)
☆84Sep 12, 2016Updated 9 years ago
yukezhu / visual7w-toolkit
View on GitHub
Toolkit for Visual7W visual question answering dataset
☆80Oct 8, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
edward-zhu / umaru
View on GitHub
An OCR-system based on torch using the technique of LSTM/GRU-RNN, CTC and referred to the works of rnnlib and clstm.
☆66Oct 27, 2015Updated 10 years ago
iassael / torch-dropconnect
View on GitHub
Torch7 implementation of "Regularization of Neural Networks using DropConnect"
☆30Dec 4, 2015Updated 10 years ago
abhshkdz / neural-vqa-attention
View on GitHub
Attention-based Visual Question Answering in Torch
☆101Aug 13, 2017Updated 8 years ago
jnhwkim / nips-mrn-vqa
View on GitHub
Multimodal Residual Learning for Visual QA (NIPS 2016)
☆39Dec 27, 2016Updated 9 years ago
ntusteeian / VQA_CNN-LSTM
View on GitHub
Pytorch implementation of VQA: Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf) using VQA v2.0 dataset for open-ended ta…
☆23Jul 30, 2020Updated 5 years ago
kevjshih / wtl_vqa
View on GitHub
Released code for the paper: Where To Look: Focus Regions for Visual Question Answering. (CVPR2016)
☆10Apr 8, 2020Updated 6 years ago
imatge-upc / vqa-2016-cvprw
View on GitHub
Visual question answering for CVPR16 VQA Challenge.
☆41Nov 5, 2016Updated 9 years ago