lmelvix / visual-question-answering-tensorflow
Stacked attention network for answering open-ended questions about image
☆12Updated 6 years ago
Alternatives and similar repositories for visual-question-answering-tensorflow:
Users that are interested in visual-question-answering-tensorflow are comparing it to the libraries listed below
- Co-attending Regions and Detections for VQA.☆40Updated 6 years ago
- Code for paper "Image Caption Generation with Text-Conditional Semantic Attention"☆60Updated 7 years ago
- Modular and Simple approach to VQA in Keras☆21Updated 7 years ago
- The implementation of Text-guided Attention Model for Image Captioning☆21Updated 7 years ago
- Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering☆24Updated 4 years ago
- Released code for the paper: Where To Look: Focus Regions for Visual Question Answering. (CVPR2016)☆10Updated 5 years ago
- ☆20Updated 5 years ago
- Structured Attentions for Visual Question Answering☆46Updated 7 years ago
- Code for Interpretable Counting for Visual Question Answering for ICLR 2018 reproducibility challenge.☆19Updated 6 years ago
- Torch implementation for Stacked Attention Networks☆23Updated 8 years ago
- implement n2nmn with pytorch☆19Updated 6 years ago
- ☆15Updated 7 years ago
- image caption with semantic attention☆11Updated 8 years ago
- Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods☆29Updated 6 years ago
- Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)☆81Updated 6 years ago
- Tensorflow implementation of Dual Attention Network☆20Updated 7 years ago
- Memory-augmented Attention Modelling for Videos☆9Updated 8 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Updated 6 years ago
- Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering☆99Updated 8 years ago
- tensorflow implementation of show, attend and tell (ICML'15)☆19Updated 7 years ago
- Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017☆67Updated 6 years ago
- Visual Question Answering Project with state of the art single Model performance.☆131Updated 6 years ago
- ☆41Updated 8 years ago
- The implementation of the model in paper "Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition"☆26Updated 7 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆69Updated 5 years ago
- ☆30Updated 6 years ago
- Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding☆23Updated 6 years ago
- CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present☆98Updated 6 years ago
- Hadamard Product for Low-rank Bilinear Pooling☆70Updated 7 years ago
- Tensorflow implementation of "Dynamic Memory Networks for Visual and Textual Question Answering"☆79Updated 7 years ago