Zasder3 / train-CLIP-FT
☆46Updated 3 years ago
Alternatives and similar repositories for train-CLIP-FT:
Users that are interested in train-CLIP-FT are comparing it to the libraries listed below
- PyTorch code for MUST☆106Updated last year
- ☆50Updated 2 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆133Updated last year
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆37Updated 7 months ago
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and …☆60Updated 3 years ago
- [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"☆54Updated 2 years ago
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆110Updated 2 years ago
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆38Updated last year
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆56Updated last year
- ☆34Updated last year
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 2 years ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast p…☆128Updated 2 years ago
- This repository contains the code for our CVPR 2022 paper on "Integrating Language Guidance into Vision-based Deep Metric Learning".☆43Updated 2 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆27Updated 2 years ago
- Code and results accompanying our paper titled CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets☆55Updated last year
- CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)☆32Updated 2 years ago
- [CVPR 2023 (Highlight)] FAME-ViL: Multi-Tasking V+L Model for Heterogeneous Fashion Tasks☆52Updated last year
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆52Updated last year
- ☆47Updated 3 years ago
- ☆59Updated 2 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆108Updated 2 years ago
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- [ECCV 2022] FashionViL: Fashion-Focused V+L Representation Learning☆60Updated 2 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆46Updated last year
- Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)☆127Updated 11 months ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆94Updated last year
- ☆117Updated last year
- Official Code of ECCV 2022 paper MS-CLIP☆88Updated 2 years ago