☆22Oct 9, 2021Updated 4 years ago
Alternatives and similar repositories for Human-Attention-in-Image-Captioning
Users that are interested in Human-Attention-in-Image-Captioning are comparing it to the libraries listed below
Sorting:
- Implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S.Fidler, D. Lin☆12Mar 15, 2019Updated 6 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 9 months ago
- Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…☆46Jul 27, 2019Updated 6 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN (IJCAI 2017 and TPAMI)☆11Jan 17, 2019Updated 7 years ago
- Rethinking the Form of Latent States in Image Captioning☆20Aug 31, 2018Updated 7 years ago
- nocaps: novel object captioning at scale☆10May 23, 2019Updated 6 years ago
- Code of Dense Relational Captioning☆69Feb 23, 2023Updated 3 years ago
- ☆10May 10, 2019Updated 6 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- ☆30Oct 2, 2018Updated 7 years ago
- Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019☆282Dec 21, 2022Updated 3 years ago
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆60Apr 5, 2018Updated 7 years ago
- ☆15Nov 23, 2020Updated 5 years ago
- PathGan: Visual Scan-path Prediction with Generative Adversarial Networks☆42Mar 24, 2023Updated 2 years ago
- SALICON API☆31Mar 30, 2017Updated 8 years ago
- A dilated inception network for visual saliency prediction (TMM 2019)☆35Jul 19, 2023Updated 2 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Sep 5, 2018Updated 7 years ago
- Feature extraction and visualization scripts for nocaps baselines.☆18Jan 22, 2021Updated 5 years ago
- Code for Unsupervised Image Captioning☆222Mar 24, 2023Updated 2 years ago
- Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"☆43May 13, 2021Updated 4 years ago
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆21Dec 26, 2016Updated 9 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 8 years ago
- Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"☆50Jun 29, 2021Updated 4 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆91Sep 12, 2019Updated 6 years ago
- Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)☆107Jan 22, 2018Updated 8 years ago
- Using scene-specific contexts and region-based attention in neural image captioning☆45Apr 8, 2020Updated 5 years ago
- Code for "Deconvolution-Based Global Decoding for Neural Machine Translation" (COLING 2018).☆26Nov 22, 2018Updated 7 years ago
- 🖼️ Attend to You: Personalized Image Captioning with Context Sequence Memory Networks. In CVPR, 2017. Expanded : Towards Personalized Im…☆207Jan 10, 2021Updated 5 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 201…☆50Jan 28, 2020Updated 6 years ago
- 生态模拟仿真系统☆10Aug 5, 2020Updated 5 years ago
- CNN-based full-reference image quality assessment☆65Dec 7, 2017Updated 8 years ago
- domain transform filter for opencv☆30Aug 22, 2016Updated 9 years ago
- Show-and-Fool: Adversarial Examples for Image Captioning task☆56Jul 6, 2021Updated 4 years ago
- ☆26Nov 30, 2019Updated 6 years ago
- Simple vs complex temporal recurrences for video saliency prediction (BMVC 2019)☆26Nov 22, 2022Updated 3 years ago
- ☆22Mar 11, 2016Updated 9 years ago
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆63Apr 18, 2018Updated 7 years ago