LAION-AI / scaling-laws-openclipLinks

Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)

☆179

Alternatives and similar repositories for scaling-laws-openclip

Users that are interested in scaling-laws-openclip are comparing it to the libraries listed below

Sorting:

LijieFan / LaCLIP
[NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"
☆287Updated last year
facebookresearch / DCI
Densely Captioned Images (DCI) dataset repository.
☆191Updated last year
facebookresearch / diht
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
☆138Updated 2 years ago
LightDXY / FT-CLIP
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
☆223Updated 2 years ago
UCSC-VLAA / CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
☆319Updated last year
amazon-science / prompt-pretraining
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
☆259Updated last year
BAAI-DCAI / Visual-Instruction-Tuning
SVIT: Scaling up Visual Instruction Tuning
☆163Updated last year
X2FD / LVIS-INSTRUCT4V
☆133Updated last year
facebookresearch / genecis
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆61Updated 2 years ago
facebookresearch / CiT
Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".
☆78Updated 2 years ago
Computer-Vision-in-the-Wild / Elevater_Toolkit_IC
Toolkit for Elevater Benchmark
☆76Updated 2 years ago
ZhangYuanhan-AI / visual_prompt_retrieval
[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"
☆178Updated last year
goel-shashank / CyCLIP
☆120Updated 2 years ago
salesforce / MUST
PyTorch code for MUST
☆107Updated 6 months ago
SHI-Labs / CuMo
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
☆158Updated last year
sarahpratt / CuPL
☆194Updated 2 years ago
BAAI-DCAI / Dataset-Pruning
Dataset pruning for ImageNet and LAION-2B.
☆79Updated last year
mu-cai / matryoshka-mm
Matryoshka Multimodal Models
☆115Updated 9 months ago
Beckschen / ViTamin
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
☆210Updated last year
allenai / unified-io-inference
☆228Updated last year
baaivision / CapsFusion
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
☆211Updated last year
YuchenLiu98 / COMM
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
☆205Updated 10 months ago
google-research / syn-rep-learn
Learning from synthetic data - code and models
☆324Updated last year
Computer-Vision-in-the-Wild / DataDownload
☆27Updated 2 years ago
palchenli / VL-Instruction-Tuning
☆91Updated last year
altndrr / vic
Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification
☆107Updated last year
ZhangYuanhan-AI / NOAH
[TPAMI] Searching prompt modules for parameter-efficient transfer learning.
☆238Updated last year
sail-sg / ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
☆151Updated 2 years ago
hammoudhasan / SynthCLIP
Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.
☆100Updated 7 months ago
zejiangh / MILAN
PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…
☆83Updated 3 years ago