facebookresearch/SLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/SLIP)

facebookresearch / SLIP

Code release for SLIP Self-supervision meets Language-Image Pre-training

☆792

Alternatives and similar repositories for SLIP

Users that are interested in SLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Sense-GVT / DeCLIP
View on GitHub
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
☆678Sep 19, 2022Updated 3 years ago
lucidrains / x-clip
View on GitHub
A concise but complete implementation of CLIP with various experimental improvements from recent papers
☆724Oct 16, 2023Updated 2 years ago
raoyongming / DenseCLIP
View on GitHub
[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
☆549Sep 15, 2023Updated 2 years ago
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,605Jan 24, 2024Updated 2 years ago
facebookresearch / Detic
View on GitHub
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
☆2,007Mar 21, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bytedance / ibot
View on GitHub
iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)
☆776Apr 14, 2022Updated 4 years ago
ml-jku / cloob
View on GitHub
☆161Jun 13, 2022Updated 4 years ago
facebookresearch / vissl
View on GitHub
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
☆3,295Mar 3, 2024Updated 2 years ago
microsoft / UniCL
View on GitHub
[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"
☆410Nov 10, 2023Updated 2 years ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,006Updated this week
salesforce / ALBEF
View on GitHub
Code for ALBEF: a new vision-language pre-training method
☆1,757Sep 20, 2022Updated 3 years ago
kakaobrain / coyo-dataset
View on GitHub
COYO-700M: Large-scale Image-Text Pair Dataset
☆1,256Nov 30, 2022Updated 3 years ago
kdexd / virtex
View on GitHub
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
☆563Aug 22, 2025Updated 10 months ago
clip-vil / CLIP-ViL
View on GitHub
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
☆419Oct 28, 2022Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,358Mar 15, 2024Updated 2 years ago
facebookresearch / moco-v3
View on GitHub
PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057
☆1,327Nov 25, 2021Updated 4 years ago
FreddeFrallan / Multilingual-CLIP
View on GitHub
OpenAI CLIP text encoders for multiple languages!
☆833May 15, 2023Updated 3 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,047Sep 29, 2022Updated 3 years ago
ashkamath / mdetr
View on GitHub
☆1,050Oct 3, 2022Updated 3 years ago
KaiyangZhou / CoOp
View on GitHub
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
☆2,218May 20, 2024Updated 2 years ago
rom1504 / img2dataset
View on GitHub
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
☆4,436Oct 19, 2025Updated 9 months ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,366Jul 23, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
uta-smile / TCL
View on GitHub
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022
☆271Oct 2, 2024Updated last year
hustvl / MIMDet
View on GitHub
[ICCV 2023] You Only Look at One Partial Sequence
☆343Oct 21, 2023Updated 2 years ago
OFA-Sys / OFA
View on GitHub
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…
☆2,557Apr 24, 2024Updated 2 years ago
salesforce / BLIP
View on GitHub
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
☆5,716Mar 3, 2026Updated 4 months ago
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
ai-forever / ru-dolph
View on GitHub
RUDOLPH: One Hyper-Tasking Transformer can be creative as DALL-E and GPT-3 and smart as CLIP
☆254Feb 6, 2023Updated 3 years ago
openai / glide-text2im
View on GitHub
GLIDE: a diffusion-based text-conditional image synthesis model
☆3,686Mar 8, 2024Updated 2 years ago
NVlabs / GroupViT
View on GitHub
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
☆788May 10, 2022Updated 4 years ago
facebookresearch / long_seq_mae
View on GitHub
code release of research paper "Exploring Long-Sequence Masked Autoencoders"
☆100Oct 14, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
amazon-science / bigdetection
View on GitHub
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
☆399Oct 23, 2024Updated last year
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,415Jan 8, 2023Updated 3 years ago
Zasder3 / train-CLIP
View on GitHub
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
☆720Apr 15, 2022Updated 4 years ago
yzhuoning / Awesome-CLIP
View on GitHub
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
☆1,229Jun 28, 2024Updated 2 years ago
m-bain / frozen-in-time
View on GitHub
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
☆377May 19, 2022Updated 4 years ago
crowsonkb / v-diffusion-pytorch
View on GitHub
v objective diffusion inference code for PyTorch.
☆719Nov 29, 2022Updated 3 years ago
facebookresearch / msn
View on GitHub
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
☆463May 9, 2022Updated 4 years ago