dianaglzrico / neural-visual-storyteller
An implementation of the paper "Contextualize, Show and Tell: A Neural Visual Storyteller." presented at the Storytelling Workshop, co-located with NAACL 2018.
☆33Updated 5 years ago
Related projects: ⓘ
- Implementation of seq2seq model for Visual Storytelling Challenge (VIST) http://visionandlanguage.net/VIST/index.html☆58Updated 6 years ago
- I2T2I: Text-to-Image Synthesis with textual data augmentation☆30Updated 5 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 5 years ago
- A PyTorch implementation of the paper Generative Adversarial Text-to-Image Synthesis☆25Updated 4 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Updated 3 years ago
- Implementation of "MULE: Multimodal Universal Language Embedding"☆15Updated 4 years ago
- An implementation of the NAACL'18 paper "Punny Captions: Witty Wordplay in Image Descriptions".☆33Updated 6 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions.☆13Updated 5 years ago
- ☆72Updated 5 years ago
- Used LSTM on Flickr dataset☆11Updated 6 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆33Updated 3 years ago
- ☆28Updated 4 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 3 years ago
- Modular and Simple approach to VQA in Keras☆22Updated 7 years ago
- Image Captioning: Implementing the Neural Image Caption Generator with python☆63Updated 6 years ago
- GLAC Net: GLocal Attention Cascading Network for the Visual Storytelling Challenge☆43Updated 4 years ago
- Codes of AAAI 2020 paper "What Makes A Good Story? Designing Composite Rewards for Visual Storytelling"☆26Updated 3 years ago
- ☆33Updated this week
- Pytorch implementation of paper: AttnGAN Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks☆26Updated 5 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Updated 5 years ago
- https://arxiv.org/abs/1707.00836☆22Updated 6 years ago
- This repository contains the source code, models and data files for the work titled: "Unsupervised Image Style Embeddings for Retrieval a…☆12Updated 3 years ago
- ☆29Updated this week
- ☆19Updated last year
- Attention-based Visual Question Answering in Torch☆100Updated 7 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆15Updated 6 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆32Updated 5 years ago
- ☆54Updated 4 years ago
- Here we describe a new approach to train a video captioning neural network , that is not only based on the normal cross entropy loss for …☆8Updated 4 years ago
- Keras implementation of a Siamese Neural Network for Joint Multimodal Text-Image Embedding☆32Updated 6 years ago