siwooyong / Codalab-Microsoft-COCO-Image-Captioning-ChallengeLinks

🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)

☆23

Alternatives and similar repositories for Codalab-Microsoft-COCO-Image-Captioning-Challenge

Users that are interested in Codalab-Microsoft-COCO-Image-Captioning-Challenge are comparing it to the libraries listed below

Sorting:

fenglinliu98 / MIA
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）
☆65Updated 4 years ago
Gitsamshi / WeakVRD-Captioning
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆32Updated 4 years ago
HLR / Cross_Modality_Relevance
The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
☆27Updated 4 years ago
YuanEZhou / Grounded-Image-Captioning
☆63Updated 3 years ago
YuanEZhou / satic
☆26Updated 4 years ago
qinzzz / Multimodal-Alignment-Framework
Implementation for MAF: Multimodal Alignment Framework
☆46Updated 4 years ago
fawazsammani / show-edit-tell
Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020
☆80Updated 4 years ago
zmykevin / UC2
CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
☆34Updated 3 years ago
YuanEZhou / CBTrans
☆22Updated 3 years ago
bearcatt / LaBERT
A length-controllable and non-autoregressive image captioning model.
☆68Updated 4 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34Updated 5 years ago
MILVLG / rosita
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
☆56Updated 2 years ago
Dong-JinKim / DenseRelationalCaptioning
Code of Dense Relational Captioning
☆69Updated 2 years ago
SeleenaJM / CapEval
An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)
☆38Updated 5 years ago
ruotianluo / coco-caption
☆67Updated 2 years ago
salesforce / VD-BERT
☆44Updated last week
NeverMoreLCH / Awesome-VQA
A reading list of papers about Visual Question Answering.
☆32Updated 2 years ago
gujiuxiang / unpaired_image_captioning
Unpaired Image Captioning
☆36Updated 4 years ago
husthuaan / AAT
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Updated 5 years ago
VALUE-Leaderboard / StarterCode
Starter Code for VALUE benchmark
☆80Updated 2 years ago
alasdairtran / transform-and-tell
[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning
☆91Updated last year
mad-red / VSR-guided-CIC
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
☆36Updated 3 years ago
ShiYaya / emscore
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Updated 2 years ago
jayleicn / VideoLanguageFuturePred
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆49Updated 2 years ago
microsoft / M3P
Multitask Multilingual Multimodal Pre-training
☆72Updated 2 years ago
sks3i / pycocoevalcap
Microsoft COCO Caption Evaluation Tool - Python 3
☆33Updated 6 years ago
VALUE-Leaderboard / DataRelease
Data Release for VALUE Benchmark
☆31Updated 3 years ago
zmykevin / UVLP
CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
☆22Updated 3 years ago
daqingliu / coco-caption
A python3 version of coco-caption with spice.
☆19Updated 5 years ago
qingzwang / DiverseImageCaptioning
Code for "On diversity in image captioning: metrics and methods".
☆8Updated 4 years ago