microsoft / esvitLinks
EsViT: Efficient self-supervised Vision Transformers
☆412Updated last year
Alternatives and similar repositories for esvit
Users that are interested in esvit are comparing it to the libraries listed below
Sorting:
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆456Updated 3 years ago
- Self-supervised vIsion Transformer (SiT)☆331Updated 2 years ago
- Implementation of Pixel-level Contrastive Learning, proposed in the paper "Propagate Yourself", in Pytorch☆259Updated 4 years ago
- Code for the Convolutional Vision Transformer (ConViT)☆465Updated 3 years ago
- ☆246Updated 3 years ago
- Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…☆488Updated 2 years ago
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆556Updated 3 years ago
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆656Updated 4 years ago
- Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.☆559Updated last year
- (ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"☆815Updated 2 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆427Updated last year
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆231Updated 3 years ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆578Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆505Updated 2 years ago
- [CVPR 2022] Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization