moein-shariatnia/OpenAI-CLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/moein-shariatnia/OpenAI-CLIP)

moein-shariatnia / OpenAI-CLIP

Simple implementation of OpenAI CLIP model in PyTorch.

☆724

Alternatives and similar repositories for OpenAI-CLIP

Users that are interested in OpenAI-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Zasder3 / train-CLIP
View on GitHub
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
☆720Apr 15, 2022Updated 4 years ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,007Updated this week
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆34,043Mar 25, 2026Updated 3 months ago
halixness / understanding-CLIP
View on GitHub
Repo from the "Learning with limited labeled data" seminar @ Uni of Tuebingen. A collection of notes, notebooks and slideshows to underst…
☆17Apr 13, 2023Updated 3 years ago
mlfoundations / wise-ft
View on GitHub
Robust fine-tuning of zero-shot models
☆765Apr 29, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
weiyx16 / CLIP-pytorch
View on GitHub
A non-JIT version implementation / replication of CLIP of OpenAI in pytorch
☆34Jan 15, 2021Updated 5 years ago
clip-italian / clip-italian
View on GitHub
CLIP (Contrastive Language–Image Pre-training) for Italian
☆185May 11, 2023Updated 3 years ago
FreddeFrallan / Multilingual-CLIP
View on GitHub
OpenAI CLIP text encoders for multiple languages!
☆833May 15, 2023Updated 3 years ago
rmokady / CLIP_prefix_caption
View on GitHub
Simple image captioning model
☆1,421Jun 9, 2024Updated 2 years ago
rom1504 / clip-retrieval
View on GitHub
Easily compute clip embeddings and build a clip retrieval system with them
☆2,786Mar 28, 2026Updated 3 months ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,428Jun 22, 2026Updated last month
salesforce / BLIP
View on GitHub
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
☆5,716Mar 3, 2026Updated 4 months ago
rom1504 / img2dataset
View on GitHub
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
☆4,435Oct 19, 2025Updated 9 months ago
NielsRogge / Transformers-Tutorials
View on GitHub
This repository contains demos I made with the Transformers library by HuggingFace.
☆11,675Apr 20, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆792Feb 9, 2023Updated 3 years ago
salesforce / LAVIS
View on GitHub
LAVIS - A One-stop Library for Language-Vision Intelligence
☆11,255Jun 2, 2026Updated last month
lucidrains / x-clip
View on GitHub
A concise but complete implementation of CLIP with various experimental improvements from recent papers
☆724Oct 16, 2023Updated 2 years ago
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,605Jan 24, 2024Updated 2 years ago
yzhuoning / Awesome-CLIP
View on GitHub
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
☆1,229Jun 28, 2024Updated 2 years ago
TheoCoombes / ClipCap
View on GitHub
Using pretrained encoder and language models to generate captions from multimedia inputs.
☆101Mar 11, 2023Updated 3 years ago
lucidrains / denoising-diffusion-pytorch
View on GitHub
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
☆10,650Feb 11, 2026Updated 5 months ago
Sense-GVT / DeCLIP
View on GitHub
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
☆678Sep 19, 2022Updated 3 years ago
clip-vil / CLIP-ViL
View on GitHub
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
☆419Oct 28, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
haotian-liu / LLaVA
View on GitHub
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆24,932Aug 12, 2024Updated last year
lucidrains / video-diffusion-pytorch
View on GitHub
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
☆1,384May 3, 2024Updated 2 years ago
salesforce / ALBEF
View on GitHub
Code for ALBEF: a new vision-language pre-training method
☆1,757Sep 20, 2022Updated 3 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆37,000Updated this week
BatsResearch / fudd
View on GitHub
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification
☆11Nov 15, 2023Updated 2 years ago
diff-usion / Awesome-Diffusion-Models
View on GitHub
A collection of resources and papers on Diffusion Models
☆12,359Aug 1, 2024Updated last year
williamberrios / BrainScore-Transformers
View on GitHub
Code from the paper "Joint rotational invariance and adversarial training of a dual-stream Transformer yields state of the art Brain-Scor…
☆16May 15, 2022Updated 4 years ago
lucidrains / x-transformers
View on GitHub
A concise but complete full-attention transformer with a set of promising experimental features from various papers
☆5,921Updated this week
Lotfollahi-lab / CellDISECT
View on GitHub
Fairness in single-cell data
☆21Mar 25, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucidrains / DALLE-pytorch
View on GitHub
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆5,628Feb 17, 2024Updated 2 years ago
CompVis / taming-transformers
View on GitHub
Taming Transformers for High-Resolution Image Synthesis
☆6,520Jul 30, 2024Updated last year
arampacha / CLIP-rsicd
View on GitHub
☆235Aug 5, 2025Updated 11 months ago
giovanniguidi / logo-detection
View on GitHub
One-shot logo detection on images. Implementation of the paper "A Deep One-Shot Network for Query-based LogoRetrieval" (Bhunia et al. 201…
☆22Jun 18, 2024Updated 2 years ago
CompVis / latent-diffusion
View on GitHub
High-Resolution Image Synthesis with Latent Diffusion Models
☆14,111Feb 29, 2024Updated 2 years ago
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,133Jun 3, 2026Updated last month
lucidrains / parti-pytorch
View on GitHub
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
☆538Dec 8, 2023Updated 2 years ago