naver-ai / eccv-captionLinks

Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)

☆55

Alternatives and similar repositories for eccv-caption

Users that are interested in eccv-caption are comparing it to the libraries listed below

Sorting:

kakaobrain / noc
☆46Updated last year
MIMICLab / BITTERS
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
☆21Updated 2 years ago
NVlabs / PALAVRA
☆52Updated 3 years ago
google-deepmind / svo_probes
The SVO-Probes Dataset for Verb Understanding
☆31Updated 3 years ago
MIMICLab / L-Verse
L-Verse: Bidirectional Generation Between Image and Text
☆108Updated 4 months ago
jaeseokbyun / GRIT-VLP
This is an official implementation of GRIT-VLP
☆21Updated 2 years ago
naver-ai / pcme
Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)
☆133Updated last year
LuoweiZhou / coco-caption
kdexd/coco-caption@de6f385
☆26Updated 5 years ago
DavidHuji / CapDec
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
☆198Updated last year
fawazsammani / nlxgpt
NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)
☆48Updated last year
ylsung / VL_adapter
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆205Updated 2 years ago
igorbrigadir / DownloadConceptualCaptions
Reliably download millions of images efficiently
☆116Updated 4 years ago
McGill-NLP / imagecode
Code and data for ImageCoDe, a contextual vison-and-language benchmark
☆40Updated last year
Cuberick-Orion / CIRR
Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
☆117Updated 2 months ago
Cuberick-Orion / CIRPLANT
Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…
☆39Updated last year
ilkerkesen / frozen
A PyTorch implementation of Multimodal Few-Shot Learning with Frozen Language Models with OPT.
☆43Updated 3 years ago
facebookresearch / diht
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
☆138Updated 2 years ago
mlfoundations / clip_quality_not_quantity
☆29Updated 2 years ago
postBG / CosMo.pytorch
Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.
☆66Updated 2 years ago
Heidelberg-NLP / VALSE
Data repository for the VALSE benchmark.
☆37Updated last year
SivanDoveh / DAC
Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models
☆27Updated last year
megvii-research / protoclip
📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)
☆53Updated last year
Computer-Vision-in-the-Wild / Elevater_Toolkit_IC
Toolkit for Elevater Benchmark
☆73Updated last year
goel-shashank / CyCLIP
☆120Updated 2 years ago
google-research-datasets / Crisscrossed-Captions
Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
☆51Updated 4 years ago
naver-ai / pcmepp
Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)
☆57Updated last year
VALUE-Leaderboard / DataRelease
Data Release for VALUE Benchmark
☆31Updated 3 years ago
e-bug / iglue
[ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"
☆49Updated 2 years ago
LooperXX / ManagerTower
Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
☆11Updated 7 months ago
redcaps-dataset / redcaps-downloader
Command-line tool for downloading and extending the RedCaps dataset.
☆48Updated last year