yekeren / ADVISE-Image_ads_understanding
ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and External Knowledge for Decoding Advertisements".
☆26Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for ADVISE-Image_ads_understanding
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆20Updated 6 years ago
- 👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)☆9Updated 4 years ago
- hierarchical convolutional attention networks for text classification☆16Updated 5 years ago
- Implementation of "MULE: Multimodal Universal Language Embedding"☆15Updated 4 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Updated 3 years ago
- Generate a denotation graph from a set of image captions☆15Updated 6 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆37Updated 6 years ago
- For visual commonsense model☆34Updated 5 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Updated 5 years ago
- implement n2nmn with pytorch☆19Updated 5 years ago
- a list of recent papers on transfer learning☆24Updated 6 years ago
- ☆20Updated 6 years ago
- An attempt at a PyTorch Implementation of "Zero-Shot" Super-Resolution using Deep Internal Learning by Shocher et al. CVPR 2018☆13Updated 6 years ago
- https://arxiv.org/abs/1707.00836☆22Updated 7 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆15Updated 6 years ago
- ☆48Updated last year
- Attempts to understand deep learning and the Tensorflow RNN api by implementing a (very)crude version of the google DeViSE paper(2013).☆7Updated 8 years ago
- List of papers that applied graph network to NLP☆13Updated 5 years ago
- Code for Unsupervised Discovery of Multimodal Links in Multi-Image/Multi-Sentence Documents☆30Updated 4 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 7 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆34Updated 3 years ago
- Fast-Slow Recurrent Neural Networks☆14Updated 6 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions.☆13Updated 5 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Updated 7 years ago
- Keras implementation of a Siamese Neural Network for Joint Multimodal Text-Image Embedding☆32Updated 7 years ago
- Code for paper "Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling"☆7Updated 5 years ago
- The implementation of the model in paper "Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition"☆27Updated 7 years ago