FlyEgle / MAE-pytorch
Masked Autoencoders Are Scalable Vision Learners
☆247Updated last year
Related projects ⓘ
Alternatives and complementary repositories for MAE-pytorch
- Official MegEngine implementation of RepLKNet☆268Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆484Updated last year
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆193Updated last year
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆447Updated last year
- iFormer: Inception Transformer☆243Updated last year
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆580Updated last year
- [CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning☆284Updated 2 years ago
- [NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.☆171Updated last year
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆627Updated 3 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆425Updated last year
- Implementation of Convolutional enhanced image Transformer☆101Updated 3 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆183Updated 2 years ago
- Self-supervised vIsion Transformer (SiT)☆325Updated last year
- Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.☆549Updated 10 months ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆363Updated 10 months ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆929Updated 2 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆547Updated last year
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆149Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆281Updated 2 years ago
- Pytorch version of Vision Transformer (ViT) with pretrained models. This is part of CASL (https://casl-project.github.io/) and ASYML proj…☆342Updated 4 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆322Updated 11 months ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification☆573Updated last year
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆224Updated 2 years ago
- PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers☆224Updated 3 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆354Updated 2 years ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆248Updated last year
- RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality (CVPR 2022)☆303Updated last year
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆190Updated 2 years ago
- [CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427☆370Updated 2 years ago
- Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition☆547Updated 3 years ago