kantharajucn / CLIP-imagenet-evaluationLinks
Run CLIP inference on the ImageNet dataset and use these inferences as labels to train other models and again evaluate the trained model on Imagenet validation dataset using original labels or CLIP labels
☆12Updated 4 years ago
Alternatives and similar repositories for CLIP-imagenet-evaluation
Users that are interested in CLIP-imagenet-evaluation are comparing it to the libraries listed below
Sorting:
- Code for Finetune like you pretrain: Improved finetuning of zero-shot vision models☆100Updated last year
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆205Updated 2 years ago
- ☆168Updated last year
- ☆243Updated 3 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆265Updated 9 months ago
- Exploring Visual Prompts for Adapting Large-Scale Models☆281Updated 3 years ago
- [ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383☆414Updated 2 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆402Updated last year
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆157Updated 2 years ago
- Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)☆132Updated last year
- PyTorch implementation of SimCLR: supports multi-GPU training and closely reproduces results☆206Updated last year
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆660Updated 2 years ago
- ImageNet-R(endition) and DeepAugment (ICCV 2021)☆268Updated 4 years ago
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆859Updated last year
- Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers☆94Updated last year
- Reliably download millions of images efficiently☆116Updated 4 years ago
- ImageNet-Sketch data set for evaluating model's ability in learning (out-of-domain) semantics at ImageNet scale☆219Updated 3 years ago
- METER: A Multimodal End-to-end TransformER Framework☆373Updated 2 years ago
- ☆191Updated 2 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆774Updated 2 years ago
- Official Implementation of SWAD (NeurIPS 2021)☆165Updated 2 years ago
- Toolkit for Elevater Benchmark☆73Updated last year
- EsViT: Efficient self-supervised Vision Transformers☆413Updated last year
- The repository for the official Biased Action Recognition (BAR) dataset for the paper Learning from Failure: Training Debiased Classifier…☆33Updated 4 years ago
- Flickr30K Entities Dataset☆177Updated 6 years ago
- An implementation of "A Simple Framework for Contrastive Learning of Visual Representatoins" SimCLR☆33Updated 4 years ago
- Recent Advances in Vision and Language Pre-training (VLP)☆292Updated 2 years ago
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)☆371Updated last year
- project page for VinVL☆356Updated last year
- Pretrained SimCLRv2 models in Pytorch☆105Updated 4 years ago