VedantYadav / VQA
VQA - Visual Question Answering
☆13Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for VQA
- Modular and Simple approach to VQA in Keras☆22Updated 7 years ago
- The implementation of Text-guided Attention Model for Image Captioning☆22Updated 7 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆90Updated 5 years ago
- Tensorflow implementation of C. Gan, Z. Gan, X. He, J. Gao, and L. Deng, “StyleNet: Generating Attractive Visual Captions with Styles”☆9Updated 6 years ago
- ☆20Updated 5 years ago
- Stacked attention network for answering open-ended questions about image☆12Updated 6 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Updated 6 years ago
- Chinese Visual Question Answering 中文看图问答☆47Updated 7 years ago
- This repository gives a GUI using PyQt4 for VQA demo using Keras Deep Learning Library. The VQA model is created using Pre-trained VGG-1…☆46Updated 3 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆37Updated 6 years ago
- PyTorch VQA implementation that achieved top performances in the (ECCV18) VizWiz Grand Challenge: Answering Visual Questions from Blind P…☆60Updated 6 years ago
- Image Captioning in Chinese using LSTM RNN with attention mechanism☆39Updated 6 years ago
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆57Updated 6 years ago
- Image Captioning based on Bottom-Up and Top-Down Attention model☆103Updated 5 years ago
- ☆28Updated 4 years ago
- ☆54Updated 4 years ago
- Tensorflow implementation of "Dynamic Memory Networks for Visual and Textual Question Answering"☆80Updated 6 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions.☆13Updated 5 years ago
- Co-attending Regions and Detections for VQA.☆41Updated 6 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Updated 6 years ago
- The implementation of the model in paper "Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition"☆27Updated 7 years ago
- CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering☆73Updated 4 years ago
- R-VQA: Visual Question Answering with Relation Facts☆19Updated 3 years ago
- ☆28Updated 6 years ago
- ☆22Updated 6 years ago
- Visual Question Answering Project with state of the art single Model performance.☆132Updated 6 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆69Updated 4 years ago
- implement n2nmn with pytorch☆19Updated 5 years ago
- An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)☆34Updated 4 years ago