stevehuanghe / image_captioning
Image captioning models in PyTorch
☆37Updated 3 years ago
Related projects: ⓘ
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆63Updated 6 years ago
- ☆130Updated 5 years ago
- This project is out of date, I don't remember the details inside...☆85Updated 6 years ago
- Image Captions Generation with Spatial and Channel-wise Attention☆208Updated 6 years ago
- Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning☆107Updated 6 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Updated 5 years ago
- Code for paper "Image Captioning with End-to-End Attribute Detection and Subsequent Attributes Prediction". IEEE Transactions on Image Pr…☆26Updated 3 years ago
- Image Captioning based on Bottom-Up and Top-Down Attention model☆104Updated 5 years ago
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Updated 5 years ago
- Code for Discriminability objective for training descriptive captions(CVPR 2018)☆110Updated 4 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Updated 5 years ago
- NBT with some changes to run smoothly with python3☆16Updated 5 years ago
- Bottom-up features extractor implemented in PyTorch.☆71Updated 4 years ago
- ☆83Updated 3 years ago
- Learning to Evaluate Image Captioning. CVPR 2018☆83Updated 6 years ago
- Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…☆47Updated 5 years ago
- ☆10Updated 6 years ago
- CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present☆98Updated 5 years ago
- fork from https://github.com/jwyang/faster-rcnn.pytorch☆10Updated 6 years ago
- ☆34Updated this week
- two models for visual relationship detection☆93Updated 5 years ago
- This is our PyTorch implementation of Multi-level Scene Description Network (MSDN) proposed in our ICCV 2017 paper.☆227Updated 4 years ago
- TensorFlow Implementation of Deep Cross-Modal Projection Learning☆94Updated 4 years ago
- Pytorch Implementation of Learning Deep Representations of Fine-grained Visual Descriptions☆14Updated 6 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Updated 4 years ago
- Pytorch implementation of "Learning Deep Structure-Preserving Image-Text Embeddings"☆37Updated 4 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Updated 4 years ago
- Use transformer for captioning☆156Updated 5 years ago
- Code for Unsupervised Image Captioning☆215Updated last year
- Code for GHA (ACCV2018)☆13Updated 5 years ago