☆22Oct 9, 2021Updated 4 years ago
Alternatives and similar repositories for Human-Attention-in-Image-Captioning
Users that are interested in Human-Attention-in-Image-Captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 11 months ago
- Implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S.Fidler, D. Lin☆12Mar 15, 2019Updated 7 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…☆46Jul 27, 2019Updated 6 years ago
- Rethinking the Form of Latent States in Image Captioning☆20Aug 31, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- nocaps: novel object captioning at scale☆10May 23, 2019Updated 6 years ago
- Code of Dense Relational Captioning☆69Feb 23, 2023Updated 3 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN (IJCAI 2017 and TPAMI)☆11Jan 17, 2019Updated 7 years ago
- ☆15Nov 23, 2020Updated 5 years ago
- Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019☆282Dec 21, 2022Updated 3 years ago
- ☆30Oct 2, 2018Updated 7 years ago
- Feature extraction and visualization scripts for nocaps baselines.☆18Jan 22, 2021Updated 5 years ago
- A dilated inception network for visual saliency prediction (TMM 2019)☆35Jul 19, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆60Apr 5, 2018Updated 8 years ago
- SALICON API☆31Mar 30, 2017Updated 9 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 8 years ago
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆21Dec 26, 2016Updated 9 years ago
- ☆22Mar 11, 2016Updated 10 years ago
- Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"☆43May 13, 2021Updated 4 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Sep 5, 2018Updated 7 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- CNN-based full-reference image quality assessment☆65Dec 7, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- All NLP experiments described in ArXiv paper 1904.02682☆32Jun 24, 2019Updated 6 years ago
- Code for Unsupervised Image Captioning☆223Mar 24, 2023Updated 3 years ago
- Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)☆107Jan 22, 2018Updated 8 years ago
- Re-implement the work from "Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework"☆34Dec 3, 2018Updated 7 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- Show-and-Fool: Adversarial Examples for Image Captioning task☆56Jul 6, 2021Updated 4 years ago
- Code for "Deconvolution-Based Global Decoding for Neural Machine Translation" (COLING 2018).