InnerPeace-Wu / densecap-tensorflowLinks

Re-implement CVPR2017 paper: "dense captioning with joint inference and visual context" and minor changes in Tensorflow. (mAP 8.296 after 500k iters of training)

☆61

Alternatives and similar repositories for densecap-tensorflow

Users that are interested in densecap-tensorflow are comparing it to the libraries listed below

Sorting:

yufengm / Adaptive
Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
☆107Updated 7 years ago
linjieyangsc / densecap
Dense captioning with joint inference and visual context
☆53Updated 6 years ago
aditya12agd5 / convcap
☆129Updated 6 years ago
peteanderson80 / Up-Down-Captioner
Automatic image captioning model based on Caffe, using features from bottom-up attention.
☆246Updated 2 years ago
Cyanogenoid / vqa-counting
[ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering
☆206Updated 6 years ago
gujiuxiang / Stack-Captioning
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
☆62Updated 7 years ago
jiasenlu / AdaptiveAttention
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
☆337Updated 7 years ago
ruotianluo / DiscCaptioning
Code for Discriminability objective for training descriptive captions(CVPR 2018)
☆109Updated 5 years ago
rakshithShetty / captionGAN
Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"
☆66Updated 6 years ago
aimbrain / vqa-project
Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering
☆149Updated 6 years ago
zhegan27 / SCN_for_video_captioning
Using Semantic Compositional Networks for Video Captioning
☆96Updated 6 years ago
yikang-li / MSDN
This is our PyTorch implementation of Multi-level Scene Description Network (MSDN) proposed in our ICCV 2017 paper.
☆228Updated 5 years ago
Wentong-DST / up-down-captioner
Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"
☆29Updated 6 years ago
ruotianluo / Transformer_Captioning
Use transformer for captioning
☆156Updated 6 years ago
s-gupta / visual-concepts
Code for detecting visual concepts in images.
☆150Updated 7 years ago
lukemelas / image-paragraph-captioning
[EMNLP 2018] Training for Diversity in Image Paragraph Captioning
☆89Updated 5 years ago
fengyang0317 / unsupervised_captioning
Code for Unsupervised Image Captioning
☆218Updated 2 years ago
richardaecn / cvpr18-caption-eval
Learning to Evaluate Image Captioning. CVPR 2018
☆83Updated 7 years ago
zhegan27 / Semantic_Compositional_Nets
The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"
☆68Updated 7 years ago
kacky24 / stylenet
A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"
☆62Updated 4 years ago
doubledaibo / gancaption_iccv2017
Towards Diverse and Natural Image Descriptions via a Conditional GAN
☆75Updated 7 years ago
peteanderson80 / coco-caption
Adds SPICE metric to coco-caption evaluation server codes
☆50Updated 2 years ago
poojahira / image-captioning-bottom-up-top-down
PyTorch implementation of Image captioning with Bottom-up, Top-down Attention
☆166Updated 6 years ago
njchoma / transformer_image_caption
Image Captioning based on Bottom-Up and Top-Down Attention model
☆102Updated 6 years ago
VisionLearningGroup / caption-guided-saliency
Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)
☆107Updated 7 years ago
tsenghungchen / show-adapt-and-tell
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
☆148Updated 6 years ago
Cadene / murel.bootstrap.pytorch
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
☆195Updated 5 years ago
yuzcccc / vqa-mfb
☆182Updated 6 years ago
cswhjiang / Recurrent_Fusion_Network
Source code for "Recurrent Fusion Network for Image Captioning".
☆23Updated 6 years ago
JonghwanMun / TextguidedATT
The implementation of Text-guided Attention Model for Image Captioning
☆21Updated 7 years ago