giangnguyen2412 / Show-Attend-and-Tell-Pytorch-Implementation
☆9Updated 5 years ago
Alternatives and similar repositories for Show-Attend-and-Tell-Pytorch-Implementation:
Users that are interested in Show-Attend-and-Tell-Pytorch-Implementation are comparing it to the libraries listed below
- The implementation of Text-guided Attention Model for Image Captioning☆21Updated 7 years ago
- Rethinking the Form of Latent States in Image Captioning☆21Updated 6 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Updated 6 years ago
- Co-attending Regions and Detections for VQA.☆40Updated 6 years ago
- neural baby talk reimplementation with python3☆16Updated 6 years ago
- Stacked attention network for answering open-ended questions about image☆12Updated 6 years ago
- Contrastive Learning for Image Captioning☆50Updated 7 years ago
- ☆20Updated 3 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Updated 6 years ago
- The implementation of the model in paper "Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition"☆26Updated 7 years ago
- ☆30Updated 6 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Updated 3 years ago
- image caption with semantic attention☆11Updated 8 years ago
- BISON: Binary Image SelectiON☆49Updated 3 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 9 years ago
- Image Caption, Show and Tell.☆21Updated 7 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆34Updated 5 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆19Updated 9 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Updated 7 years ago
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆62Updated 7 years ago
- Attempts to understand deep learning and the Tensorflow RNN api by implementing a (very)crude version of the google DeViSE paper(2013).☆7Updated 8 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Updated 6 years ago
- Multi-Target Embodied Question Answering☆11Updated 6 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 8 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆50Updated 5 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆80Updated 4 years ago
- Code used by the paper "What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption Generator?".☆14Updated 7 years ago
- ☆71Updated 6 years ago
- Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension☆34Updated 7 years ago
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆58Updated 7 years ago