nocaps: novel object captioning at scale
☆10May 23, 2019Updated 6 years ago
Alternatives and similar repositories for nocaps
Users that are interested in nocaps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- natural annotated text-category pairs for text classification☆10Sep 10, 2021Updated 4 years ago
- Implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S.Fidler, D. Lin☆12Mar 15, 2019Updated 7 years ago
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆21Dec 26, 2016Updated 9 years ago
- ☆23Aug 18, 2018Updated 7 years ago
- ☆22Oct 9, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Awesome-Adversarial-Attack-Methods-Summary☆13Jul 24, 2024Updated last year
- Rethinking the Form of Latent States in Image Captioning☆20Aug 31, 2018Updated 7 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆50Dec 18, 2019Updated 6 years ago
- Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…☆46Jul 27, 2019Updated 6 years ago
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆60Apr 5, 2018Updated 8 years ago
- A python3 library for evaluating caption's BLEU, Meteor, CIDEr, SPICE,ROUGE_L,WMD score. Fork from https://github.com/ruotianluo/coco-cap…☆22Nov 25, 2020Updated 5 years ago
- Code for "Deconvolution-Based Global Decoding for Neural Machine Translation" (COLING 2018).☆26Nov 22, 2018Updated 7 years ago
- ☆14Jun 5, 2020Updated 5 years ago
- Show-and-Fool: Adversarial Examples for Image Captioning task☆56Jul 6, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Dec 28, 2018Updated 7 years ago
- Universal Adversarial Perturbations for Vision-Language Pre-trained Models☆24Aug 8, 2025Updated 9 months ago
- ☆30Oct 2, 2018Updated 7 years ago
- ☆10May 4, 2018Updated 8 years ago
- ☆21Jan 15, 2024Updated 2 years ago
- image caption with semantic attention☆11Apr 1, 2017Updated 9 years ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 6 months ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of CVPR 2016 paper☆74Jan 31, 2021Updated 5 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 11 months ago
- [ICCV-2025] Universal Adversarial Attack, Multimodal Adversarial Attacks, VLP models, Contrastive Learning, Cross-modal Perturbation Gene…☆36Jul 10, 2025Updated 9 months ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- neural baby talk reimplementation with python3☆16May 2, 2019Updated 7 years ago
- ☆10Aug 28, 2020Updated 5 years ago
- ☆10Apr 20, 2018Updated 8 years ago
- ☆10May 10, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multimodal deep quality embedding network (MMDQEN) for affective video content analysis. (MM'19, TAFFC'20)☆10Jul 24, 2021Updated 4 years ago
- ☆12Dec 9, 2018Updated 7 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Sep 5, 2018Updated 7 years ago
- Official repository for the AAAI2026 paper (Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery …☆27Apr 24, 2026Updated 2 weeks ago
- A series of models applying memory augmented neural networks to machine translation☆15May 3, 2018Updated 8 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 8 years ago
- pytorch code for recurring paper:Zero-Shot Detection【https://arxiv.org/abs/1803.07113】☆18Jan 16, 2019Updated 7 years ago