EdisonLeeeee/Awesome-Masked-Autoencoders

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EdisonLeeeee/Awesome-Masked-Autoencoders)

EdisonLeeeee / Awesome-Masked-Autoencoders

A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

☆868

Alternatives and similar repositories for Awesome-Masked-Autoencoders

Users that are interested in Awesome-Masked-Autoencoders are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ucasligang / awesome-MIM
View on GitHub
Reading list for research topics in Masked Image Modeling
☆333Dec 3, 2024Updated last year
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,371Jul 23, 2024Updated 2 years ago
Lupin1998 / Awesome-MIM
View on GitHub
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
☆354Apr 23, 2025Updated last year
quanlin-wu / dmae
View on GitHub
Denoising Masked Autoencoders Help Robust Classification.
☆67Jun 4, 2023Updated 3 years ago
facebookresearch / mae_st
View on GitHub
Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"
☆371Jan 12, 2026Updated 6 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,047Sep 29, 2022Updated 3 years ago
AbrahamYabo / SdAE
View on GitHub
☆64Feb 6, 2023Updated 3 years ago
Alpha-VL / ConvMAE
View on GitHub
ConvMAE: Masked Convolution Meets Masked Autoencoders
☆531Mar 14, 2023Updated 3 years ago
pengzhiliang / MAE-pytorch
View on GitHub
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
☆2,691Jul 25, 2023Updated 3 years ago
sangminwoo / awesome-vision-and-language
View on GitHub
A curated list of awesome vision and language resources (still under construction... stay tuned!)
☆562Nov 4, 2024Updated last year
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
zejiangh / MILAN
View on GitHub
PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…
☆84Aug 16, 2022Updated 3 years ago
UCSC-VLAA / DMAE
View on GitHub
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
☆109Jul 24, 2023Updated 3 years ago
LightDXY / BootMAE
View on GitHub
ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining
☆97Nov 2, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TonyLianLong / CrossMAE
View on GitHub
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
☆135Apr 10, 2025Updated last year
dk-liang / Awesome-Visual-Transformer
View on GitHub
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,589Jan 7, 2025Updated last year
diff-usion / Awesome-Diffusion-Models
View on GitHub
A collection of resources and papers on Diffusion Models
☆12,362Aug 1, 2024Updated last year
lxtGH / CAE
View on GitHub
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
☆199Jan 11, 2023Updated 3 years ago
lxtGH / Awesome-Segmentation-With-Transformer
View on GitHub
[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey
☆758Aug 25, 2024Updated last year
zhenyingfang / Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
View on GitHub
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
☆593Updated this week
open-mmlab / mmselfsup
View on GitHub
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
☆3,302Jun 25, 2023Updated 3 years ago
liuxingbin / dbot
View on GitHub
[ICLR2024] Exploring Target Representations for Masked Autoencoders
☆56Jan 17, 2024Updated 2 years ago
bytedance / ibot
View on GitHub
iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)
☆777Apr 14, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ucasligang / SemMAE
View on GitHub
[NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders
☆43Jun 18, 2023Updated 3 years ago
yuewang-cuhk / awesome-vision-language-pretraining-papers
View on GitHub
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
☆1,159Aug 19, 2022Updated 3 years ago
EPFL-VILAB / MultiMAE
View on GitHub
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
☆632Dec 13, 2022Updated 3 years ago
facebookresearch / ConvNeXt-V2
View on GitHub
Code release for ConvNeXt V2 model
☆2,066Aug 14, 2024Updated last year
gkakogeorgiou / attmask
View on GitHub
[ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling
☆73Apr 18, 2024Updated 2 years ago
yzhuoning / Awesome-CLIP
View on GitHub
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
☆1,229Jun 28, 2024Updated 2 years ago
jason718 / awesome-self-supervised-learning
View on GitHub
A curated list of awesome self-supervised methods
☆6,405Feb 24, 2026Updated 5 months ago
OliverRensu / DeepMIM
View on GitHub
[WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling
☆56May 10, 2025Updated last year
facebookresearch / msn
View on GitHub
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
☆463May 9, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
baaivision / EVA
View on GitHub
EVA Series: Visual Representation Fantasies from BAAI
☆2,685Aug 1, 2024Updated last year
cmhungsteve / Awesome-Transformer-Attention
View on GitHub
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
☆5,050Jul 30, 2024Updated last year
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,439Jun 22, 2026Updated last month
ttengwang / Awesome_Prompting_Papers_in_Computer_Vision
View on GitHub
A curated list of prompt-based paper in computer vision and vision-language learning.
☆927Dec 18, 2023Updated 2 years ago
enyac-group / supmae
View on GitHub
This is a offical PyTorch/GPU implementation of SupMAE.
☆80Aug 30, 2022Updated 3 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆37,013Jul 16, 2026Updated last week
czczup / ViT-Adapter
View on GitHub
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
☆1,503Jun 3, 2025Updated last year