Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multiple-Choice on test-standard.
☆389Mar 22, 2019Updated 6 years ago
Alternatives and similar repositories for VQA_LSTM_CNN
Users that are interested in VQA_LSTM_CNN are comparing it to the libraries listed below
Sorting:
- ☆351Oct 2, 2018Updated 7 years ago
- Simple Baseline for Visual Question Answering☆187Dec 21, 2016Updated 9 years ago
- DPPnet: Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction☆96Apr 20, 2016Updated 9 years ago
- [Reimplementation Antol et al 2015] Keras-based LSTM/CNN models for Visual Question Answering☆480Jun 11, 2018Updated 7 years ago
- ☆218Aug 13, 2016Updated 9 years ago
- ☆390Mar 11, 2021Updated 4 years ago
- Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering☆99Apr 27, 2017Updated 8 years ago
- Code for paper "Exploring Models and Data for Image Question Answering"☆81Mar 23, 2016Updated 9 years ago
- Torch7 implementation of Grid LSTM as described here: http://arxiv.org/pdf/1507.01526v2.pdf☆186Feb 10, 2016Updated 10 years ago
- code for Stacked attention networks for image question answering☆108Jan 7, 2017Updated 9 years ago
- Visual Q&A reading list☆440Oct 7, 2018Updated 7 years ago
- Torch implementation for Stacked Attention Networks☆23Nov 24, 2016Updated 9 years ago
- Neural module networks☆401Jul 7, 2017Updated 8 years ago
- Tutorial for Visual Turing Test (visual question answering, image question answering).☆118Jan 30, 2017Updated 9 years ago
- Visual Question Answering in Pytorch☆734Dec 11, 2019Updated 6 years ago
- Faster-RCNN based on Densecap(deprecated)☆84Sep 12, 2016Updated 9 years ago
- Hadamard Product for Low-rank Bilinear Pooling☆70Nov 6, 2017Updated 8 years ago
- ContextLocNet: Context-aware Deep Network Models for Weakly Supervised Localization (ECCV 2016)☆87Apr 6, 2017Updated 8 years ago
- Deterministic Policy Gradient using torch7☆43Jun 2, 2016Updated 9 years ago
- Visual Question Answering task written in Keras that answers questions about images☆156May 10, 2019Updated 6 years ago
- VLFeat (partial) FFI wrapper for Torch7☆12Mar 23, 2016Updated 9 years ago
- Torch7 implementation of "Regularization of Neural Networks using DropConnect"☆30Dec 4, 2015Updated 10 years ago
- Unsupervised learning of visual concepts from video☆56May 5, 2016Updated 9 years ago
- An OCR-system based on torch using the technique of LSTM/GRU-RNN, CTC and referred to the works of rnnlib and clstm.☆66Oct 27, 2015Updated 10 years ago
- Visual Question Answering Demo on pretrained model☆247Oct 31, 2025Updated 4 months ago
- Deep Networks with Stochastic Depth☆481Aug 13, 2018Updated 7 years ago
- An implementation of Color2Gray with convolutional neural networks☆11Dec 23, 2015Updated 10 years ago
- Visual Question Answering in Torch☆488May 3, 2016Updated 9 years ago
- Simple PuddleWorld DQN example using torch7☆29Jun 16, 2016Updated 9 years ago
- image encoder☆13Sep 19, 2016Updated 9 years ago
- Traffic sign recognition with Torch☆175May 24, 2016Updated 9 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆765Mar 10, 2024Updated last year
- Multimodal Compact Bilinear Pooling for Torch7☆69Jan 2, 2017Updated 9 years ago
- Torch implementation of DRAW: A Recurrent Neural Network For Image Generation☆136Oct 7, 2015Updated 10 years ago
- Visual Question Answering Project with state of the art single Model performance.☆131Jun 18, 2018Updated 7 years ago
- ☆33May 17, 2016Updated 9 years ago
- automatic video description generation with GPU training☆256Jan 12, 2020Updated 6 years ago
- The second version of the interface for Abstract Scenes research project.☆23May 16, 2022Updated 3 years ago
- ☆967Sep 25, 2023Updated 2 years ago