sail-sg / mugs
A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".
☆82Updated 7 months ago
Related projects: ⓘ
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆79Updated 2 years ago
- PyTorch code for MUST☆105Updated last year
- This is a offical PyTorch/GPU implementation of SupMAE.☆76Updated 2 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆98Updated last year
- ReSSL: Relational Self-Supervised Learning with Weak Augmentation☆57Updated 2 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆99Updated last year
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆52Updated 8 months ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆100Updated 2 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆91Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆97Updated last year
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆64Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆57Updated last year
- Un-Mix: Rethinking Image Mixtures for Unsupervised Visual Representation Learning.☆149Updated 2 years ago
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆97Updated 6 months ago
- Bag of Instances Aggregation Boosts Self-supervised Distillation (ICLR 2022)☆33Updated 2 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆130Updated last year
- ☆54Updated last year
- Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kort…☆114Updated 11 months ago
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆86Updated 8 months ago
- Compress conventional Vision-Language Pre-training data☆49Updated 11 months ago
- ☆57Updated last year
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆128Updated last year
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆95Updated last year
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆61Updated 2 years ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆149Updated 9 months ago
- CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet☆203Updated last year
- ☆98Updated 6 months ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆45Updated 10 months ago
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆63Updated 9 months ago
- Code for Finetune like you pretrain: Improved finetuning of zero-shot vision models☆86Updated last year