awilliamson10 / cliporaLinks
Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).
☆24Updated last year
Alternatives and similar repositories for clipora
Users that are interested in clipora are comparing it to the libraries listed below
Sorting:
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆37Updated 6 months ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆61Updated 2 years ago
- Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification☆107Updated last year
- ☆43Updated last month
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆101Updated 2 years ago
- ☆53Updated 9 months ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆179Updated 4 months ago
- [CVPR'24] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP)…☆77Updated 5 months ago
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆287Updated last year
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated last year
- Code and results accompanying our paper titled CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets☆58Updated 2 years ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆73Updated 2 years ago
- Generating Image Specific Text☆29Updated 2 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆21Updated 11 months ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆35Updated 3 years ago
- ☆53Updated 3 years ago
- FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions☆55Updated last year
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆138Updated 2 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆100Updated last year
- ☆194Updated 2 years ago
- Paper list of compositional zero-shot learning☆10Updated 3 years ago
- ☆30Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆24Updated 11 months ago
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆52Updated 7 months ago
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Updated 2 years ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆81Updated 2 years ago
- Code for paper: Simple and Robust Loss Design for Multi-Label Learning with Missing Labels☆51Updated 2 years ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆78Updated 3 months ago
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆36Updated last year
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆53Updated 2 years ago