Dense captioning with joint inference and visual context
☆53Dec 25, 2018Updated 7 years ago
Alternatives and similar repositories for densecap
Users that are interested in densecap are comparing it to the libraries listed below
Sorting:
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Implementation of CVPR2017 paper "A Hierarchical Approach for Generating Descriptive Image Paragraphs" in Tensorflow (in progress...)☆13Jan 27, 2018Updated 8 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 8 years ago
- Automatic image captioning model based on Caffe, using features from bottom-up attention.☆249Feb 3, 2023Updated 3 years ago
- Code Release for `Learning Answer Embeddings for Visual Question Answering`. (CVPR 2018)☆13Apr 6, 2019Updated 6 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Nov 24, 2018Updated 7 years ago
- Dense image captioning in Torch☆1,599Jul 31, 2018Updated 7 years ago
- Code of Dense Relational Captioning☆69Feb 23, 2023Updated 3 years ago
- Look and Modify: Modification Networks for Image Captioning, BMVC 2019☆21Feb 18, 2020Updated 6 years ago
- Implementation of paper "Improving Image Captioning with Better Use of Caption"☆33Sep 15, 2020Updated 5 years ago
- This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 201…☆50Jan 28, 2020Updated 6 years ago
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆63Apr 18, 2018Updated 7 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- This is our PyTorch implementation of Multi-level Scene Description Network (MSDN) proposed in our ICCV 2017 paper.☆230Nov 19, 2019Updated 6 years ago
- The implementation of the model in paper "Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition"☆26Aug 8, 2017Updated 8 years ago
- ☆14Aug 5, 2018Updated 7 years ago
- The imdb files with SBD-Trans OCR for TextVQA dataset.☆11Nov 30, 2021Updated 4 years ago
- ☆12Mar 8, 2021Updated 4 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- ☆30Feb 19, 2018Updated 8 years ago
- Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)☆544Aug 9, 2019Updated 6 years ago
- A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).☆592Jan 23, 2024Updated 2 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Jun 22, 2021Updated 4 years ago
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"☆16Oct 22, 2022Updated 3 years ago
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 4 years ago
- Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"☆339Dec 26, 2017Updated 8 years ago
- Using scene-specific contexts and region-based attention in neural image captioning☆45Apr 8, 2020Updated 5 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- PyTorch implementation of Image captioning with Bottom-up, Top-down Attention☆168Jan 6, 2019Updated 7 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Apr 25, 2021Updated 4 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Dec 4, 2020Updated 5 years ago
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning☆15Dec 12, 2023Updated 2 years ago
- Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]☆273Jul 27, 2021Updated 4 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆91Sep 12, 2019Updated 6 years ago
- End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Oct 24, 2021Updated 4 years ago
- A curated list of image captioning and related area resources. :-)☆1,074Mar 28, 2023Updated 2 years ago