Re-implement CVPR2017 paper: "dense captioning with joint inference and visual context" and minor changes in Tensorflow. (mAP 8.296 after 500k iters of training)
☆60Feb 21, 2019Updated 7 years ago
Alternatives and similar repositories for densecap-tensorflow
Users that are interested in densecap-tensorflow are comparing it to the libraries listed below
Sorting:
- Dense captioning with joint inference and visual context☆53Dec 25, 2018Updated 7 years ago
- Implementation of CVPR2017 paper "A Hierarchical Approach for Generating Descriptive Image Paragraphs" in Tensorflow (in progress...)☆13Jan 27, 2018Updated 8 years ago
- Tensorflow implement of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs☆49Jul 31, 2018Updated 7 years ago
- Dense image captioning in Torch☆1,599Jul 31, 2018Updated 7 years ago
- A simple web app for visualizing the network structure of convolutional neural networks☆12Dec 27, 2017Updated 8 years ago
- A simplified pytorch version of densecap☆42Dec 11, 2024Updated last year
- This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…☆10Jul 19, 2017Updated 8 years ago
- Code of Dense Relational Captioning☆69Feb 23, 2023Updated 3 years ago
- ☆12Nov 29, 2017Updated 8 years ago
- Code Release for `Learning Answer Embeddings for Visual Question Answering`. (CVPR 2018)☆13Apr 6, 2019Updated 6 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆91Sep 12, 2019Updated 6 years ago
- Implementation of paper "Improving Image Captioning with Better Use of Caption"☆33Sep 15, 2020Updated 5 years ago
- VQA baseline with Conditional Batch Normalization☆15Apr 9, 2018Updated 7 years ago
- Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017☆149Jan 8, 2019Updated 7 years ago
- package for dense color histogram and dense SIFT feature extraction☆16Jul 14, 2014Updated 11 years ago
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆114Mar 24, 2022Updated 3 years ago
- The project is about predicting sets (of classes) from images.☆23Aug 31, 2021Updated 4 years ago
- This is our PyTorch implementation of Multi-level Scene Description Network (MSDN) proposed in our ICCV 2017 paper.☆230Nov 19, 2019Updated 6 years ago
- PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"☆25Apr 8, 2023Updated 2 years ago
- Look and Modify: Modification Networks for Image Captioning, BMVC 2019☆21Feb 18, 2020Updated 6 years ago
- ☆20Oct 21, 2022Updated 3 years ago
- Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.☆1,007Oct 5, 2023Updated 2 years ago
- Code for Knowledge-Embedded Routing Network for Scene Graph Generation (CVPR 2019)☆123Aug 17, 2022Updated 3 years ago
- A simple script for generating Pascal VOC devkit-style annotations for the WIDER faces dataset☆21Dec 14, 2017Updated 8 years ago
- Towards Diverse and Natural Image Descriptions via a Conditional GAN☆75Dec 2, 2017Updated 8 years ago
- State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classif…☆49Feb 17, 2019Updated 7 years ago
- PyTorch code for the paper: "Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation"☆19Aug 5, 2021Updated 4 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Jun 13, 2023Updated 2 years ago
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆46Jul 29, 2020Updated 5 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Aug 8, 2019Updated 6 years ago
- ROIPooling for pytorch☆51Apr 8, 2019Updated 6 years ago
- Using scene-specific contexts and region-based attention in neural image captioning☆45Apr 8, 2020Updated 5 years ago
- Factorizable Net (Multi-GPU version): An Efficient Subgraph-based Framework for Scene Graph Generation☆221Jul 25, 2019Updated 6 years ago
- ☆21Jul 25, 2024Updated last year
- ☆218Feb 26, 2022Updated 4 years ago
- The implementation of the model in paper "Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition"☆26Aug 8, 2017Updated 8 years ago
- Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)☆544Aug 9, 2019Updated 6 years ago
- Meshed-Memory Transformer for Image Captioning. CVPR 2020☆545Dec 21, 2022Updated 3 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Nov 24, 2018Updated 7 years ago