showkeyjar / chinese_im2text.pytorchLinks
PyTorch implementation of Chinese image captioning on AI_challenger dataset
☆34Updated 5 years ago
Alternatives and similar repositories for chinese_im2text.pytorch
Users that are interested in chinese_im2text.pytorch are comparing it to the libraries listed below
Sorting:
- code for fluency-guided cross-lingual image captioning☆33Updated 7 years ago
- ☆37Updated 7 years ago
- Image Captioning based on Bottom-Up and Top-Down Attention model☆104Updated 6 years ago
- Repository for image caption for Chinese☆28Updated 8 years ago
- 图像中文描述☆99Updated 7 years ago
- Code for AI Challenger contest. (Generating chinese image captions)☆216Updated 7 years ago
- Chinese Visual Question Answering 中文看图问答☆47Updated 8 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆70Updated 5 years ago
- Cross-lingual image captioning☆90Updated 3 years ago
- A video captioning tool using S2VT method and attention mechanism (TensorFlow)☆15Updated 7 years ago
- Image Captioning in Chinese using LSTM RNN with attention mechanism☆38Updated 7 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Updated 7 years ago
- Ad-hoc Video Search☆28Updated 4 years ago
- CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present☆98Updated 6 years ago
- Position Focused Attention Network for Image-Text Matching☆69Updated 6 years ago
- 图像中文描述+视觉注意 力☆192Updated 5 years ago
- Code for GHA (ACCV2018)☆13Updated 7 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Updated 5 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆91Updated 6 years ago
- ☆93Updated 8 years ago
- The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral☆68Updated 6 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Updated 5 years ago
- Deep Cross-Modal Projection Learning for Image-Text Matching☆74Updated 5 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆51Updated 5 years ago
- Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".☆212Updated 5 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Updated 7 years ago
- ☆22Updated 7 years ago
- Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval(CVPR2018)☆165Updated 7 years ago
- This is an implementation of image caption, based on two different papers. The two papers are: 1. Show and Tell: A Neural Image Caption G…☆30Updated 6 years ago
- Extension of hLSTMat☆19Updated 4 years ago