jmhessel/clipscore

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jmhessel/clipscore)

jmhessel / clipscore

CLIPScore EMNLP code

☆251

Alternatives and similar repositories for clipscore

Users that are interested in clipscore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aimagelab / pacscore
View on GitHub
[CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
☆66Jul 29, 2025Updated 11 months ago
DavidMChan / clair
View on GitHub
CLAIR: A (surprisingly) simple semantic text metric with large language models.
☆22Jan 28, 2024Updated 2 years ago
jungokasai / THumB
View on GitHub
☆15Apr 8, 2022Updated 4 years ago
HAWLYQ / InfoMetIC
View on GitHub
☆13Sep 5, 2023Updated 2 years ago
hwanheelee1993 / ViLBERTScore
View on GitHub
Code for ViLBERTScore in EMNLP Eval4NLP
☆18Oct 27, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
evanmiltenburg / MeasureDiversity
View on GitHub
Measure the diversity of image descriptions, repository for our COLING 2018 paper.
☆13Dec 29, 2019Updated 6 years ago
MILVLG / mt-captioning
View on GitHub
A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning
☆25Sep 4, 2020Updated 5 years ago
quangvnai / grit
View on GitHub
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
☆199May 9, 2023Updated 3 years ago
jmhessel / pycocoevalcap
View on GitHub
Python 3 support for the MS COCO caption evaluation tools
☆14Jun 14, 2024Updated 2 years ago
joeyz0z / ConZIC
View on GitHub
Official implementation of "ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"
☆76Sep 20, 2023Updated 2 years ago
aimagelab / awesome-captioning-evaluation
View on GitHub
[IJCAI 2025] Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives
☆36Nov 25, 2025Updated 7 months ago
DavidHuji / CapDec
View on GitHub
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
☆209Jan 28, 2024Updated 2 years ago
vinid / neg_clip
View on GitHub
NegCLIP.
☆41Feb 6, 2023Updated 3 years ago
zmykevin / UVLP
View on GitHub
CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
☆21Apr 15, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
j-min / CLIP-Caption-Reward
View on GitHub
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
☆246Jun 10, 2025Updated last year
tgxs002 / HPSv2
View on GitHub
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
☆677May 24, 2024Updated 2 years ago
ShiYaya / emscore
View on GitHub
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Oct 20, 2022Updated 3 years ago
agneet42 / revision
View on GitHub
[ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"
☆13Aug 6, 2024Updated last year
rmokady / CLIP_prefix_caption
View on GitHub
Simple image captioning model
☆1,421Jun 9, 2024Updated 2 years ago
LisaAnne / Hallucination
View on GitHub
☆97Mar 29, 2019Updated 7 years ago
RyanLiut / awesome-diverse-captioning
View on GitHub
Some papers about *diverse* image (a few videos) captioning
☆25Apr 4, 2023Updated 3 years ago
Seth-Park / comp-t2i-dataset
View on GitHub
Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)
☆45May 3, 2022Updated 4 years ago
baaaad / ECE
View on GitHub
[ECCV'22 Poster] Explicit Image Caption Editing
☆22Nov 30, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Georgelingzj / up-to-date-Vision-Language-Models
View on GitHub
Up-to-date Vision Language Models collection. Mainly focus on computer vision
☆20Feb 9, 2023Updated 3 years ago
dhg-wei / DeCap
View on GitHub
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
☆144Mar 16, 2023Updated 3 years ago
furkanbiten / object-bias
View on GitHub
Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
linzhiqiu / CLIP-FlanT5
View on GitHub
Training code for CLIP-FlanT5
☆31Jul 29, 2024Updated last year
liujch1998 / vera
View on GitHub
☆17May 23, 2023Updated 3 years ago
drboog / Lafite
View on GitHub
Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)
☆184Mar 23, 2023Updated 3 years ago
Yushi-Hu / tifa
View on GitHub
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
☆186Apr 29, 2024Updated 2 years ago
YuigaWada / Polos
View on GitHub
[CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
☆33Jun 12, 2026Updated last month
microsoft / SwinBERT
View on GitHub
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
☆250May 26, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TobiasLee / VEC
View on GitHub
Visual and Embodied Concepts evaluation benchmark
☆21Oct 10, 2023Updated 2 years ago
tylin / coco-caption
View on GitHub
☆1,225May 13, 2024Updated 2 years ago
ezeli / Transformer_model
View on GitHub
A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.
☆12Nov 15, 2021Updated 4 years ago
xfactlab / I0T
View on GitHub
[ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap
☆12Jun 18, 2025Updated last year
microsoft / RegionCLIP
View on GitHub
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
☆817Mar 20, 2024Updated 2 years ago
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆346May 7, 2026Updated 2 months ago
pzzhang / VinVL
View on GitHub
project page for VinVL
☆360Jul 26, 2023Updated 2 years ago