kenya-sk / show_attend_and_tell
This repository reimplements "Show, Attend and Tell" model and add extra deep learning techniques.
☆11Updated last year
Alternatives and similar repositories for show_attend_and_tell:
Users that are interested in show_attend_and_tell are comparing it to the libraries listed below
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Updated 4 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆49Updated 5 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆22Updated 4 years ago
- Code for GHA (ACCV2018)☆13Updated 6 years ago
- Orderless Recurrent Models for Multi-label Classification☆43Updated 4 years ago
- PyTorch Implementation of Knowing When to Look: Adaptive Attention via a Visual Sentinal for Image Captioning☆84Updated 4 years ago
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆65Updated 4 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆54Updated 2 years ago
- Implementation of paper "Improving Image Captioning with Better Use of Caption"☆32Updated 4 years ago
- KSSNet: Multi-Label Classification with Label Graph Superimposing☆59Updated 4 years ago
- video captioning☆24Updated 5 years ago
- Image Caption with Attention | a PyTorch Project to Image Caption☆17Updated 5 years ago
- The method of text-to-image☆48Updated 5 years ago
- ☆26Updated 3 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆37Updated last year
- Codes of ICMR 2019 short paper "Weakly Supervised Image Retrieval via Coarse-scale Feature Fusion and Multi-level Attention Blocks"☆31Updated 2 years ago
- Video classification, youtube8m, Knowledge distillation, Tensorflow, NeXtVLAD☆26Updated 5 years ago
- code for fluency-guided cross-lingual image captioning☆30Updated 6 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 4 years ago
- Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019☆30Updated 3 years ago
- Starter code for the VMT task and challenge☆51Updated 4 years ago
- ☆20Updated 5 years ago
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆59Updated 4 years ago
- Deep Cross-Modal Projection Learning for Image-Text Matching☆74Updated 4 years ago
- TensorFlow Implementation of Deep Cross-Modal Projection Learning☆95Updated 5 years ago
- Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries, ECCV 2018☆75Updated 3 years ago
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆65Updated 5 years ago
- ☆19Updated 2 years ago
- This project is out of date, I don't remember the details inside...☆84Updated 7 years ago
- Implementations of Recent Papers in Computer Vision☆39Updated 2 years ago