EdisonLeeeee / Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
☆788Updated 5 months ago
Alternatives and similar repositories for Awesome-Masked-Autoencoders:
Users that are interested in Awesome-Masked-Autoencoders are comparing it to the libraries listed below
- PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057☆1,225Updated 3 years ago
- ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in co…☆944Updated 3 months ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆935Updated 2 years ago
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,064Updated last year
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆554Updated 2 years ago
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,070Updated 7 months ago
- (ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"☆807Updated 2 years ago
- A curated list of prompt-based paper in computer vision and vision-language learning.☆904Updated 11 months ago
- Reading list for research topics in Masked Image Modeling☆333Updated 2 weeks ago
- Explainability for Vision Transformers☆868Updated 2 years ago
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,284Updated 8 months ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆490Updated last year
- Recent Transformer-based CV and related works.☆1,324Updated last year
- [ICLR 2023 Oral] Image as Set of Points☆547Updated 7 months ago
- PyTorch implementation of Masked Autoencoder☆236Updated last year
- [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)☆307Updated 2 months ago
- A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis☆539Updated last year
- Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).☆1,150Updated 5 months ago
- PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566☆1,169Updated last year
- A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.☆393Updated 2 months ago
- CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark☆635Updated last month
- Awesome Papers related to Mamba.☆1,256Updated 2 months ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,298Updated 6 months ago
- Label-Efficient Semantic Segmentation with Diffusion Models (ICLR'2022)☆671Updated last year
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆323Updated 3 weeks ago
- [Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications☆633Updated last month
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆978Updated 6 months ago
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆880Updated 2 months ago
- A paper list of some recent Transformer-based CV works.☆1,152Updated this week
- Low rank adaptation for Vision Transformer☆368Updated 8 months ago