Wentong-DST / self-critical
PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for self-critical
- Unpaired Image Captioning☆35Updated 3 years ago
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆64Updated 4 years ago
- ☆62Updated 2 years ago
- python codes for CIDEr - Consensus-based Image Caption Evaluation☆32Updated 5 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆65Updated last year
- This is the implementation of self-CIDEr and LSA-based diversity metrics (only for python 2.7).☆36Updated 2 years ago
- R-VQA: Visual Question Answering with Relation Facts☆19Updated 3 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆90Updated 5 years ago
- An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)☆34Updated 4 years ago
- Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"☆43Updated 3 years ago
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆45Updated last year
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆57Updated 6 years ago
- Contrastive Learning for Image Captioning☆51Updated 6 years ago
- ☆22Updated 6 years ago
- Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…☆47Updated 5 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Updated 5 years ago
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆40Updated 5 years ago
- Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019☆17Updated 5 years ago
- BottomUpTopDown VQA model with question-type debiasing☆23Updated 5 years ago
- This project is out of date, I don't remember the details inside...☆85Updated 6 years ago
- Counterfactual Samples Synthesizing for Robust VQA☆76Updated last year
- Code for the model "Heterogeneous Graph Learning for Visual Commonsense Reasoning (NeurlPS 2019)"☆46Updated 4 years ago
- Information Maximizing Visual Question Generation☆66Updated last year
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Updated 4 years ago
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Updated 5 years ago
- ☆29Updated 6 years ago
- Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for …☆21Updated 2 years ago
- ☆32Updated last year
- NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering☆59Updated 3 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Updated 4 years ago