tulip-berkeley/open_clip

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tulip-berkeley/open_clip)

tulip-berkeley / open_clip

An open source implementation of CLIP (With TULIP Support)

☆165

Alternatives and similar repositories for open_clip

Users that are interested in open_clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

haoyu-bu / CAFe
View on GitHub
Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"
☆33Mar 26, 2025Updated last year
NVlabs / QLIP
View on GitHub
[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation
☆97Mar 1, 2025Updated last year
mingukkang / FlashDecoder
View on GitHub
Official FlashDecoder Github
☆15Apr 4, 2026Updated 3 months ago
Jingfeng0705 / LIFT
View on GitHub
The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders
☆43Jun 10, 2025Updated last year
wuw2019 / LoTLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
☆49Jan 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ant-research / DreamLIP
View on GitHub
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆138May 8, 2025Updated last year
XMUDeepLIT / LLaVE
View on GitHub
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
☆78May 23, 2025Updated last year
bronyayang / Law_of_Vision_Representation_in_MLLMs
View on GitHub
[COLM'25] Official implementation of the Law of Vision Representation in MLLMs
☆176Oct 6, 2025Updated 9 months ago
likyoo / awesome-semi-supervised-segmentation
View on GitHub
An up-to-date & curated list of awesome semi-supervised segmentation papers, methods & resources.
☆13Dec 22, 2023Updated 2 years ago
sung-yeon-kim / R-Adapter-ECCV2024
View on GitHub
Official PyTorch Implementation of Efficient and Versatile Robust Fine-Tuning of Zero-shot Models, ECCV 2024
☆17Oct 3, 2024Updated last year
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
CompVis / maskflow
View on GitHub
MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation
☆28Mar 4, 2025Updated last year
dfan / webssl
View on GitHub
Code for Scaling Language-Free Visual Representation Learning (WebSSL)
☆244Apr 24, 2025Updated last year
junha-l / dexter
View on GitHub
☆20Jun 5, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gongda0e / KARI
View on GitHub
Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)
☆14Jun 14, 2024Updated 2 years ago
facebookresearch / perception_models
View on GitHub
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆2,324Apr 13, 2026Updated 3 months ago
TonyLianLong / igligen
View on GitHub
Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation
☆46Jun 1, 2024Updated 2 years ago
chrockey / RIST
View on GitHub
[CVPR 2024] Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform
☆14Apr 9, 2026Updated 3 months ago
wdrink / SimpleAR
View on GitHub
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
☆431Jun 20, 2025Updated last year
tianyu-z / VCR
View on GitHub
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
☆32Feb 26, 2025Updated last year
SAITPublic / BiRF
View on GitHub
[NeurIPS 2023] Official PyTorch implementation of Binary Radiance Fields
☆23Jan 9, 2024Updated 2 years ago
343gltysprk / ovow
View on GitHub
☆39Nov 25, 2025Updated 7 months ago
chrockey / Affostruction
View on GitHub
[CVPR 2026] Affostruction: 3D Affordance Grounding with Generative Reconstruction
☆17May 14, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zhangzef / COOPER
View on GitHub
The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.
☆38Jul 1, 2026Updated 2 weeks ago
vision-x-nyu / test-set-training
View on GitHub
☆15Nov 25, 2025Updated 7 months ago
wookiekim / HCCNet
View on GitHub
Official PyTorch implementation of HCCNet: Efficient Semantic Matching with Hypercolumn Correlation (WACV '24 Oral, Best paper finalist (…
☆11Apr 29, 2024Updated 2 years ago
Mozhgan91 / LEO
View on GitHub
LEO: A powerful Hybrid Multimodal LLM
☆20Jan 18, 2025Updated last year
wenwenzju / TGRMPT
View on GitHub
Multi-Person Tracking in Tour Guide Robot
☆10Aug 23, 2022Updated 3 years ago
apple / ml-aim
View on GitHub
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
☆1,425Aug 4, 2025Updated 11 months ago
DAMO-NLP-SG / Inf-CLIP
View on GitHub
[CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for C…
☆287Jan 16, 2025Updated last year
NVlabs / RADIO
View on GitHub
Official repository for "AM-RADIO: Reduce All Domains Into One"
☆1,897May 29, 2026Updated last month
apple / ml-flextok
View on GitHub
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆321Jun 2, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
kongds / E5-V
View on GitHub
E5-V: Universal Embeddings with Multimodal Large Language Models
☆275Dec 10, 2025Updated 7 months ago
DAMO-NLP-SG / DiGIT
View on GitHub
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
☆78Oct 31, 2024Updated last year
RAIVNLab / sugar-crepe
View on GitHub
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
☆93Feb 13, 2024Updated 2 years ago
MME-Benchmarks / MME-Unify
View on GitHub
✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models
☆42Apr 10, 2025Updated last year
haon-chen / MoCa
View on GitHub
☆68Aug 14, 2025Updated 11 months ago
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
LAION-AI / scaling-laws-for-comparison
View on GitHub
☆22May 12, 2026Updated 2 months ago