entalent / coco-caption-py3

☆12

Related projects: ⓘ

LibertFan / ImageCaption
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019
☆17Updated 5 years ago
andyweizhao / Multitask_Image_Captioning
☆22Updated 6 years ago
chrisc36 / bottom-up-attention-vqa
BottomUpTopDown VQA model with question-type debiasing
☆23Updated 4 years ago
JXZe / DualVD
☆76Updated last year
husthuaan / AAT
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆49Updated 4 years ago
fenglinliu98 / MIA
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）
☆64Updated 3 years ago
gujiuxiang / unpaired_image_captioning
Unpaired Image Captioning
☆35Updated 3 years ago
qingzwang / DiversityMetrics
This is the implementation of self-CIDEr and LSA-based diversity metrics (only for python 2.7).
☆35Updated 2 years ago
yiyang92 / caption-stylenet_tensorflow
Tensorflow implementation of C. Gan, Z. Gan, X. He, J. Gao, and L. Deng, “StyleNet: Generating Attractive Visual Captions with Styles”
☆9Updated 5 years ago
eric-xw / Video-guided-Machine-Translation
Starter code for the VMT task and challenge
☆50Updated 4 years ago
bupt-cist / DFAF-for-VQA.pytorch
☆47Updated this week
ck0123 / improved-bertscore-for-image-captioning-evaluation
☆22Updated last month
asdf0982 / vqa-mfb.pytorch
This project is out of date, I don't remember the details inside...
☆85Updated 6 years ago
YuanEZhou / Grounded-Image-Captioning
☆62Updated 2 years ago
daqingliu / CAVP
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…
☆47Updated 5 years ago
cswhjiang / Recurrent_Fusion_Network
Source code for "Recurrent Fusion Network for Image Captioning".
☆23Updated 5 years ago
fawazsammani / show-edit-tell
Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020
☆81Updated 4 years ago
yanxinzju / CSS-VQA
Counterfactual Samples Synthesizing for Robust VQA
☆76Updated last year
entalent / MemCap
code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`
☆11Updated 4 years ago
gujiuxiang / Stack-Captioning
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
☆63Updated 6 years ago
HaoYang0123 / Position-Focused-Attention-Network
Position Focused Attention Network for Image-Text Matching
☆66Updated 5 years ago
jamespark3922 / adv-inf
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
☆34Updated 5 years ago
astro-zihao / mucko
☆14Updated this week
Deanplayerljx / tab-vcr
Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671
☆19Updated 3 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34Updated 4 years ago
HLR / Cross_Modality_Relevance
The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
☆26Updated 3 years ago
SeleenaJM / CapEval
An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)
☆33Updated 4 years ago
mrsalehi / ground-sentence-video
Implementation of the EMNLP 2018 paper "Temporally Grounding Natural Sentence in Video" using PyTorch
☆2Updated last year
erobic / negative_analysis_of_grounding
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Updated 4 years ago