HAWLYQ / InfoMetICLinks
☆14Updated last year
Alternatives and similar repositories for InfoMetIC
Users that are interested in InfoMetIC are comparing it to the libraries listed below
Sorting:
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆62Updated 2 weeks ago
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆27Updated last year
- Benchmark data for "Rethinking Benchmarks for Cross-modal Image-text Retrieval" (SIGIR 2023)☆25Updated 2 years ago
- ☆16Updated 3 years ago
- ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities☆43Updated 2 months ago
- natual language guided image captioning☆84Updated last year
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆35Updated last year
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆36Updated 11 months ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆198Updated last year
- implementation of paper https://arxiv.org/abs/2210.04559☆54Updated 2 years ago
- ☆24Updated last year
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆53Updated 8 months ago
- Bounding and Filling: A Fast and Flexible Framework for Image Captioning☆9Updated last year
- Python 3 support for the MS COCO caption evaluation tools☆14Updated last year
- LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation☆132Updated last year
- SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation☆117Updated last year
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆60Updated 2 years ago
- Official implementation of "ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"☆74Updated last year
- [CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion m…☆64Updated last year
- ☆15Updated 3 years ago
- [ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…☆51Updated last year
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆135Updated 2 years ago
- ☆37Updated last year
- Colorful Prompt Tuning for Pre-trained Vision-Language Models☆49Updated 2 years ago
- Official repository for the A-OKVQA dataset☆96Updated last year
- [CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension☆54Updated last year
- The SVO-Probes Dataset for Verb Understanding☆31Updated 3 years ago
- Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos☆25Updated last year
- NegCLIP.☆34Updated 2 years ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Updated last year