LAION-AI/CLIP_benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LAION-AI/CLIP_benchmark)

LAION-AI / CLIP_benchmark

CLIP-like model evaluation

☆814

Alternatives and similar repositories for CLIP_benchmark

Users that are interested in CLIP_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,006Updated this week
rom1504 / img2dataset
View on GitHub
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
☆4,436Oct 19, 2025Updated 9 months ago
mlfoundations / datacomp
View on GitHub
DataComp: In search of the next generation of multimodal datasets
☆787Apr 28, 2025Updated last year
rom1504 / clip-retrieval
View on GitHub
Easily compute clip embeddings and build a clip retrieval system with them
☆2,786Mar 28, 2026Updated 3 months ago
UCSC-VLAA / CLIPA
View on GitHub
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
☆321Jun 3, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FreddeFrallan / Multilingual-CLIP
View on GitHub
OpenAI CLIP text encoders for multiple languages!
☆833May 15, 2023Updated 3 years ago
facebookresearch / MetaCLIP
View on GitHub
NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024
☆1,846Nov 27, 2025Updated 7 months ago
LAION-AI / scaling-laws-openclip
View on GitHub
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
☆200Jun 21, 2025Updated last year
mlfoundations / wise-ft
View on GitHub
Robust fine-tuning of zero-shot models
☆765Apr 29, 2022Updated 4 years ago
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆792Feb 9, 2023Updated 3 years ago
mertyg / vision-language-models-are-bows
View on GitHub
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …
☆294Jun 7, 2023Updated 3 years ago
mlfoundations / clip_quality_not_quantity
View on GitHub
☆28Oct 18, 2022Updated 3 years ago
salesforce / LAVIS
View on GitHub
LAVIS - A One-stop Library for Language-Vision Intelligence
☆11,253Jun 2, 2026Updated last month
mlfoundations / open_flamingo
View on GitHub
An open-source framework for training large multimodal models.
☆4,114Aug 31, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
baaivision / EVA
View on GitHub
EVA Series: Visual Representation Fantasies from BAAI
☆2,686Aug 1, 2024Updated last year
lucidrains / x-clip
View on GitHub
A concise but complete implementation of CLIP with various experimental improvements from recent papers
☆724Oct 16, 2023Updated 2 years ago
facebookresearch / diht
View on GitHub
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
☆141Dec 16, 2025Updated 7 months ago
mlfoundations / open-diffusion
View on GitHub
Simple large-scale training of stable diffusion with multi-node support.
☆132May 8, 2023Updated 3 years ago
Sense-GVT / DeCLIP
View on GitHub
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
☆678Sep 19, 2022Updated 3 years ago
kakaobrain / coyo-dataset
View on GitHub
COYO-700M: Large-scale Image-Text Pair Dataset
☆1,256Nov 30, 2022Updated 3 years ago
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,605Jan 24, 2024Updated 2 years ago
salesforce / BLIP
View on GitHub
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
☆5,716Mar 3, 2026Updated 4 months ago
google-research / big_vision
View on GitHub
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
☆3,494May 19, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
beichenzbc / Long-CLIP
View on GitHub
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
☆900Aug 13, 2024Updated last year
RAIVNLab / sugar-crepe
View on GitHub
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
☆93Feb 13, 2024Updated 2 years ago
ant-research / DreamLIP
View on GitHub
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆138May 8, 2025Updated last year
LijieFan / LaCLIP
View on GitHub
[NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"
☆291Jan 14, 2024Updated 2 years ago
LAION-AI / General-GPT
View on GitHub
☆65Oct 4, 2023Updated 2 years ago
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆34,038Mar 25, 2026Updated 3 months ago
microsoft / X-Decoder
View on GitHub
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
☆1,346Oct 5, 2023Updated 2 years ago
yzhuoning / Awesome-CLIP
View on GitHub
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
☆1,229Jun 28, 2024Updated 2 years ago
allenai / mmc4
View on GitHub
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
☆953Mar 19, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
rom1504 / laion-prepro
View on GitHub
Get hundred of million of image+url from the crawling at home dataset and preprocess them
☆222May 26, 2024Updated 2 years ago
KaiyangZhou / CoOp
View on GitHub
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
☆2,218May 20, 2024Updated 2 years ago
wuw2019 / LoTLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
☆49Jan 14, 2025Updated last year
mlfoundations / patching
View on GitHub
Patching open-vocabulary models by interpolating weights
☆91Sep 28, 2023Updated 2 years ago
facebookresearch / CiT
View on GitHub
Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".
☆78Jan 18, 2023Updated 3 years ago
Zasder3 / train-CLIP
View on GitHub
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
☆720Apr 15, 2022Updated 4 years ago
tsb0601 / MMVP
View on GitHub
☆364Jan 27, 2024Updated 2 years ago