FreddeFrallan/Multilingual-CLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FreddeFrallan/Multilingual-CLIP)

FreddeFrallan / Multilingual-CLIP

OpenAI CLIP text encoders for multiple languages!

☆833

Alternatives and similar repositories for Multilingual-CLIP

Users that are interested in Multilingual-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆791Feb 9, 2023Updated 3 years ago
lucidrains / x-clip
View on GitHub
A concise but complete implementation of CLIP with various experimental improvements from recent papers
☆724Oct 16, 2023Updated 2 years ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,019Jul 17, 2026Updated last week
LAION-AI / CLIP_benchmark
View on GitHub
CLIP-like model evaluation
☆815Mar 19, 2026Updated 4 months ago
rom1504 / img2dataset
View on GitHub
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
☆4,436Oct 19, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Zasder3 / train-CLIP
View on GitHub
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
☆720Apr 15, 2022Updated 4 years ago
rom1504 / clip-retrieval
View on GitHub
Easily compute clip embeddings and build a clip retrieval system with them
☆2,788Mar 28, 2026Updated 3 months ago
kakaobrain / coyo-dataset
View on GitHub
COYO-700M: Large-scale Image-Text Pair Dataset
☆1,256Nov 30, 2022Updated 3 years ago
Sense-GVT / DeCLIP
View on GitHub
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
☆677Sep 19, 2022Updated 3 years ago
OFA-Sys / OFA
View on GitHub
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…
☆2,557Apr 24, 2024Updated 2 years ago
google-research-datasets / wit
View on GitHub
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique imag…
☆1,113Sep 27, 2024Updated last year
ghchen18 / acl23_mclip
View on GitHub
The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'
☆10Jan 23, 2024Updated 2 years ago
BAAI-WuDao / BriVL
View on GitHub
Bridging Vision and Language Model
☆286Mar 27, 2023Updated 3 years ago
gregor-ge / mBLIP
View on GitHub
☆88Jan 10, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
salesforce / ALBEF
View on GitHub
Code for ALBEF: a new vision-language pre-training method
☆1,755Sep 20, 2022Updated 3 years ago
j-min / CLIP-Caption-Reward
View on GitHub
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
☆246Jun 10, 2025Updated last year
lucidrains / DALLE-pytorch
View on GitHub
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆5,628Feb 17, 2024Updated 2 years ago
salesforce / BLIP
View on GitHub
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
☆5,717Mar 3, 2026Updated 4 months ago
e-bug / iglue
View on GitHub
[ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"
☆49Dec 7, 2022Updated 3 years ago
zengyan-97 / CCLM
View on GitHub
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))
☆93Jun 12, 2023Updated 3 years ago
Lednik7 / CLIP-ONNX
View on GitHub
It is a simple library to speed up CLIP inference up to 3x (K80 GPU)
☆234Jul 20, 2023Updated 3 years ago
allenai / mmc4
View on GitHub
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
☆953Mar 19, 2025Updated last year
ChenRocks / UNITER
View on GitHub
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
☆799Jun 30, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆34,063Mar 25, 2026Updated 3 months ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
salesforce / LAVIS
View on GitHub
LAVIS - A One-stop Library for Language-Vision Intelligence
☆11,255Jun 2, 2026Updated last month
rmokady / CLIP_prefix_caption
View on GitHub
Simple image captioning model
☆1,421Jun 9, 2024Updated 2 years ago
yzhuoning / Awesome-CLIP
View on GitHub
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
☆1,229Jun 28, 2024Updated 2 years ago
facebookresearch / MetaCLIP
View on GitHub
NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024
☆1,848Nov 27, 2025Updated 7 months ago
lucidrains / CoCa-pytorch
View on GitHub
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
☆1,195Dec 12, 2023Updated 2 years ago
clip-vil / CLIP-ViL
View on GitHub
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
☆419Oct 28, 2022Updated 3 years ago
mlfoundations / datacomp
View on GitHub
DataComp: In search of the next generation of multimodal datasets
☆787Apr 28, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kakaobrain / mindall-e
View on GitHub
PyTorch implementation of a 1.3B text-to-image generation model trained on 14 million image-text pairs
☆631Aug 9, 2022Updated 3 years ago
rom1504 / laion-prepro
View on GitHub
Get hundred of million of image+url from the crawling at home dataset and preprocess them
☆222May 26, 2024Updated 2 years ago
zai-org / CogView
View on GitHub
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,797Sep 25, 2023Updated 2 years ago
baaivision / EVA
View on GitHub
EVA Series: Visual Representation Fantasies from BAAI
☆2,685Aug 1, 2024Updated last year
JD-P / simulacra-aesthetic-captions
View on GitHub
Dataset of prompts, synthetic AI generated images, and aesthetic ratings.
☆427Jul 29, 2022Updated 3 years ago
microsoft / M3P
View on GitHub
Multitask Multilingual Multimodal Pre-training
☆72Nov 27, 2022Updated 3 years ago
navervision / KELIP
View on GitHub
Official PyTorch implementation of "Large-scale Bilingual Language-Image Contrastive Learning" (ICLRW 2022)
☆96Apr 13, 2022Updated 4 years ago