kantharajucn / CLIP-imagenet-evaluationLinks
Run CLIP inference on the ImageNet dataset and use these inferences as labels to train other models and again evaluate the trained model on Imagenet validation dataset using original labels or CLIP labels
☆12Updated 4 years ago
Alternatives and similar repositories for CLIP-imagenet-evaluation
Users that are interested in CLIP-imagenet-evaluation are comparing it to the libraries listed below
Sorting:
- Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)☆133Updated last year
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆207Updated 2 years ago
- ☆175Updated last year
- Exploring Visual Prompts for Adapting Large-Scale Models☆285Updated 3 years ago
- ImageNet-R(endition) and DeepAugment (ICCV 2021)☆275Updated 4 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆266Updated last year
- Recent Advances in Vision and Language Pre-training (VLP)☆294Updated 2 years ago
- Code for Finetune like you pretrain: Improved finetuning of zero-shot vision models☆103Updated 2 years ago
- Reliably download millions of images efficiently☆117Updated 4 years ago
- [ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383☆416Updated 3 years ago
- PyTorch implementation of SimCLR: supports multi-GPU training and closely reproduces results☆210Updated last year
- Flickr30K Entities Dataset☆180Updated 6 years ago
- Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers☆97Updated 2 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆402Updated last year
- Dynamic Token Expansion with Continual Transformers, accepted at CVPR 2022☆145Updated 3 years ago
- A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".☆84Updated last year
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Updated last year
- "Automatically Discovering and Learning New Visual Categories with Ranking Statistics" by Kai Han, Sylvestre-Alvise Rebuffi, Sebastien Eh…☆229Updated 5 years ago
- METER: A Multimodal End-to-end TransformER Framework☆373Updated 2 years ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆163Updated 3 years ago
- ☆59Updated 3 years ago
- EsViT: Efficient self-supervised Vision Transformers☆411Updated 2 years ago
- [CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning☆208Updated 3 years ago
- Official Implementation of SWAD (NeurIPS 2021)☆169Updated 2 years ago
- Pretrained SimCLRv2 models in Pytorch☆105Updated 5 years ago
- ☆194Updated 2 years ago
- Toolkit for Elevater Benchmark☆75Updated 2 years ago
- PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"☆240Updated 2 years ago
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆667Updated 3 years ago
- Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models☆123Updated 3 weeks ago