showkeyjar / chinese_im2text.pytorchView external linksLinks
PyTorch implementation of Chinese image captioning on AI_challenger dataset
☆34Dec 25, 2019Updated 6 years ago
Alternatives and similar repositories for chinese_im2text.pytorch
Users that are interested in chinese_im2text.pytorch are comparing it to the libraries listed below
Sorting:
- AI Challenger Image Caption Competition☆10Dec 13, 2017Updated 8 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆51Dec 18, 2019Updated 6 years ago
- 深度学习实现图像中文描述☆28Nov 23, 2018Updated 7 years ago
- ☆37Jan 5, 2018Updated 8 years ago
- Code for AI Challenger contest. (Generating chinese image captions)☆216Oct 19, 2018Updated 7 years ago
- AI CHALLENGER 全球AI挑战赛 图像中文描述☆18Jan 19, 2018Updated 8 years ago
- Repository for image caption for Chinese☆28Dec 3, 2017Updated 8 years ago
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆339May 2, 2021Updated 4 years ago
- ☆10Apr 20, 2018Updated 7 years ago
- A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.☆12Nov 15, 2021Updated 4 years ago
- A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'☆10Jan 20, 2020Updated 6 years ago
- Pytorch implementation of TSE attention☆16Jul 9, 2021Updated 4 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆13Sep 24, 2017Updated 8 years ago
- 图像中文描述+视觉注意力☆192Jan 9, 2020Updated 6 years ago
- Face completion using Generative Adversarial Networks☆11Sep 21, 2017Updated 8 years ago
- Dynamic Early Exit for Image Captioning☆17Oct 25, 2022Updated 3 years ago
- an PyTorch image deep style transfer library. It provies implementations of current SOTA algorithms, including AdaIN, WCT, LinearStyleTra…☆13Apr 15, 2020Updated 5 years ago
- Image Caption workout with NIC and NBT☆15Apr 5, 2019Updated 6 years ago
- Implementation of the Object Relation Transformer for Image Captioning☆180Sep 17, 2024Updated last year
- Image Chinese Description Generation Based on Multi-level Selective Visual Semantic Attributes☆16Nov 2, 2021Updated 4 years ago
- Implementation of paper "Improving Image Captioning with Better Use of Caption"☆33Sep 15, 2020Updated 5 years ago
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆24Jul 5, 2022Updated 3 years ago
- Code for Unsupervised Image Captioning☆221Mar 24, 2023Updated 2 years ago
- A lightweight framework using binary hash codes and deep learning for fast image retrieval.☆22Jun 20, 2017Updated 8 years ago
- Self-Erasing Network for Integral Object Attention☆54Nov 27, 2018Updated 7 years ago
- ☆23Aug 18, 2018Updated 7 years ago
- Cross-lingual image captioning☆91May 9, 2022Updated 3 years ago
- Chinese text generation, now open source news and prose model and code☆24Jun 12, 2023Updated 2 years ago
- A neural network architecture for realtime simultaneous face detection, landmark localization, pose estimation and gender recognition.☆22Oct 3, 2017Updated 8 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- Repository for an end-to-end image captioning method PTSN(ACM MM22).☆60Dec 11, 2022Updated 3 years ago
- Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]☆273Jul 27, 2021Updated 4 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Jun 28, 2021Updated 4 years ago
- Code for paper "Image Captioning with End-to-End Attribute Detection and Subsequent Attributes Prediction". IEEE Transactions on Image Pr…☆26Mar 24, 2021Updated 4 years ago
- Implement-of-SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Caption☆54Jul 5, 2019Updated 6 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Nov 24, 2018Updated 7 years ago
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆63Apr 18, 2018Updated 7 years ago
- Multimodal deep quality embedding network (MMDQEN) for affective video content analysis. (MM'19, TAFFC'20)☆10Jul 24, 2021Updated 4 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆70Jan 27, 2020Updated 6 years ago