CLAIR: A (surprisingly) simple semantic text metric with large language models.
☆21Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for clair
Users that are interested in clair are comparing it to the libraries listed below
Sorting:
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆17Apr 28, 2025Updated 10 months ago
- ☆16May 23, 2023Updated 2 years ago
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Oct 13, 2023Updated 2 years ago
- A curated list of zero-shot captioning papers☆24Aug 26, 2023Updated 2 years ago
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated 10 months ago
- NegCLIP.☆39Feb 6, 2023Updated 3 years ago
- CLIPScore EMNLP code☆245Dec 16, 2022Updated 3 years ago
- Improving Continuous Sign Language Recognition with Adapted Image Models☆14Nov 10, 2025Updated 3 months ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated last month
- Jupyter Notebooks from book UNDERSTANDING DEEP LEARNING (Prof Simon Prince) that I could solve.☆12Mar 20, 2024Updated last year
- ☆46Oct 27, 2023Updated 2 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 2 years ago
- ☆11May 17, 2024Updated last year
- ☆11Sep 8, 2024Updated last year
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆12Oct 8, 2024Updated last year
- ☆10Mar 30, 2023Updated 2 years ago
- Self-Supervised Learning with Multi-View Rendering for 3D Point Cloud Analysis (ACCV 2022)☆10Jul 22, 2024Updated last year
- The Easiest Way to Run Commands as Systemd Services☆10Aug 27, 2025Updated 6 months ago
- ☆10Jul 5, 2024Updated last year
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- Ranking-Consistent Language-Image Pretraining☆12Oct 24, 2025Updated 4 months ago
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆55Aug 16, 2024Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated 3 weeks ago
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆271Feb 8, 2026Updated 2 weeks ago
- ☆11Oct 2, 2024Updated last year
- ☆11Sep 15, 2023Updated 2 years ago
- final-project-level3-nlp-02 created by GitHub Classroom☆11Dec 31, 2021Updated 4 years ago
- [NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"☆17Dec 4, 2024Updated last year
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆57Jun 1, 2025Updated 8 months ago
- ☆14Nov 25, 2025Updated 3 months ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆15Jun 3, 2025Updated 8 months ago
- ☆25Nov 22, 2024Updated last year
- Official code of *Towards Event-oriented Long Video Understanding*☆12Jul 26, 2024Updated last year
- Convert pdf to pages of images☆13Apr 18, 2020Updated 5 years ago
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆12Jun 18, 2025Updated 8 months ago
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Nov 7, 2024Updated last year
- The Official Code Repo for EgoOrientBench [CVPR25]☆14Nov 24, 2025Updated 3 months ago
- "Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024…☆13Nov 20, 2025Updated 3 months ago