foamliu / Image-Captioning-PyTorch
图像中文描述+视觉注意力
☆182Updated 4 years ago
Related projects: ⓘ
- ☆159Updated this week
- 图像中文描述☆94Updated 6 years ago
- Cross-lingual image captioning☆82Updated 2 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆34Updated 4 years ago
- 基于ClipCap的看图说话Image Caption模型☆271Updated 2 years ago
- Image Caption workout with NIC and NBT☆15Updated 5 years ago
- code for fluency-guided cross-lingual image captioning☆32Updated 6 years ago
- Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]☆269Updated 3 years ago
- 视频的文本摘要(标注),输入一段视频,通过深度学习网络和人工智能程序识别视频主要表达的意思(Input a video output a txt decribing the video)。☆179Updated 6 years ago
- [AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”☆214Updated 5 months ago
- Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).☆193Updated 2 years ago
- 深度学习实现图像中文描述☆24Updated 5 years ago
- Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks☆178Updated last year
- Image Captioning based on Bottom-Up and Top-Down Attention model☆104Updated 5 years ago
- Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …☆200Updated last year
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆325Updated 3 years ago
- Code for AI Challenger contest. (Generating chinese image captions)☆213Updated 5 years ago
- PyTorch implementation of Image captioning with Bottom-up, Top-down Attention☆160Updated 5 years ago
- Image Chinese Description Generation Based on Multi-level Selective Visual Semantic Attributes☆13Updated 2 years ago
- PyTorch Implementation of Knowing When to Look: Adaptive Attention via a Visual Sentinal for Image Captioning☆82Updated 4 years ago
- This is an implementation of the paper "Show and Tell: A Neural Image Caption Generator".☆19Updated 5 years ago
- Bridging Vision and Language Model☆279Updated last year
- 500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。☆127Updated 5 years ago
- VQA-tf2☆12Updated 3 years ago
- Image Caption with Attention | a PyTorch Project to Image Caption☆16Updated 5 years ago
- Repository for image caption for Chinese☆27Updated 6 years ago
- code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"☆90Updated 4 years ago
- Image Captioning in Chinese using LSTM RNN with attention mechanism☆39Updated 5 years ago
- A bilingual dataset for image captioning☆17Updated 3 years ago
- 看图说话,基于keras,支持GPU。Image captioning code in keras, runs on GPU.☆23Updated 4 years ago