Toolkit for Visual7W visual question answering dataset
☆81Oct 8, 2019Updated 6 years ago
Alternatives and similar repositories for visual7w-toolkit
Users that are interested in visual7w-toolkit are comparing it to the libraries listed below
Sorting:
- Visual7W visual question answering models☆64Oct 8, 2019Updated 6 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- Multimodal Residual Learning for Visual QA (NIPS 2016)☆38Dec 27, 2016Updated 9 years ago
- Code for our paper "CliqueCNN: Deep Unsupervised Exemplar Learning" https://arxiv.org/abs/1608.08792☆22Nov 10, 2017Updated 8 years ago
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 8 years ago
- Simple Baseline for Visual Question Answering☆187Dec 21, 2016Updated 9 years ago
- Attention-based Visual Question Answering in Torch☆101Aug 13, 2017Updated 8 years ago
- Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multipl…☆388Mar 22, 2019Updated 6 years ago
- Visual question answering for CVPR16 VQA Challenge.☆41Nov 5, 2016Updated 9 years ago
- Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for …☆27Mar 10, 2022Updated 3 years ago
- ☆351Oct 2, 2018Updated 7 years ago
- Tensorflow implementation of "Dynamic Memory Networks for Visual and Textual Question Answering"☆79Mar 22, 2018Updated 7 years ago
- ☆390Mar 11, 2021Updated 4 years ago
- Deterministic Policy Gradient using torch7☆43Jun 2, 2016Updated 9 years ago
- Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017☆67Sep 20, 2018Updated 7 years ago
- Hadamard Product for Low-rank Bilinear Pooling☆70Nov 6, 2017Updated 8 years ago
- code for Stacked attention networks for image question answering☆108Jan 7, 2017Updated 9 years ago
- Torch implementation of seq2seq machine translation with GRU RNN and attention☆77Dec 4, 2016Updated 9 years ago
- Multimodal Compact Bilinear Pooling for Torch7☆69Jan 2, 2017Updated 9 years ago
- Implementation of the approach described in "Understanding deep features with computer-generated imagery" , M. Aubry and B. Russell, ICCV…☆21Aug 15, 2023Updated 2 years ago
- LSTM with associative memory cells (http://arxiv.org/abs/1602.03032)☆109May 1, 2016Updated 9 years ago
- Very Deep Pairwise Word Interaction Neural Networks for modeling textual similarity (He and Lin, NAACL/HLT 2016)☆18May 27, 2018Updated 7 years ago
- Code release for Hu et al. Natural Language Object Retrieval, in CVPR, 2016☆112Jul 31, 2016Updated 9 years ago
- CVPR'17 Spotlight: What’s in a Question: Using Visual Questions as a Form of Supervision☆44Aug 31, 2018Updated 7 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Code for detecting visual concepts in images.☆150Feb 27, 2018Updated 8 years ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆163Feb 8, 2019Updated 7 years ago
- Faster-RCNN based on Densecap(deprecated)☆84Sep 12, 2016Updated 9 years ago
- Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization☆62Feb 12, 2019Updated 7 years ago
- A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data, TPAMI, http://arxiv.org/abs/1409.3970☆39Aug 26, 2015Updated 10 years ago
- Referring Expression Generation using Neural Networks☆22Dec 8, 2022Updated 3 years ago
- Sequenced Show, Attend, and Tell: Natural Language from Natural Images☆12Jun 15, 2016Updated 9 years ago
- Spectral LDA☆13Jun 22, 2018Updated 7 years ago
- Visual Bidirectional Kernelized Network for Visual Question Answering☆11Jul 17, 2017Updated 8 years ago
- For training very deep networks☆10Jun 12, 2017Updated 8 years ago
- Wide-residual network implementations. Best result for cifar10(97.12%), cifar100(84.12%), and other kaggle challenges☆37Jan 13, 2017Updated 9 years ago
- Code for replicating results in 'On Weight Initializations in Deep Neural Networks'☆10Apr 28, 2017Updated 8 years ago
- Stacked attention network for answering open-ended questions about image☆12May 31, 2018Updated 7 years ago
- Implementing FastSent in theano☆12May 2, 2016Updated 9 years ago