ucasligang/awesome-MIM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ucasligang/awesome-MIM)

ucasligang / awesome-MIM

Reading list for research topics in Masked Image Modeling

☆333

Alternatives and similar repositories for awesome-MIM

Users that are interested in awesome-MIM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EdisonLeeeee / Awesome-Masked-Autoencoders
View on GitHub
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
☆868Jul 10, 2024Updated 2 years ago
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,047Sep 29, 2022Updated 3 years ago
lxtGH / CAE
View on GitHub
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
☆199Jan 11, 2023Updated 3 years ago
Alpha-VL / ConvMAE
View on GitHub
ConvMAE: Masked Convolution Meets Masked Autoencoders
☆531Mar 14, 2023Updated 3 years ago
bytedance / ibot
View on GitHub
iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)
☆777Apr 14, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hustvl / MIMDet
View on GitHub
[ICCV 2023] You Only Look at One Partial Sequence
☆343Oct 21, 2023Updated 2 years ago
zejiangh / MILAN
View on GitHub
PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…
☆84Aug 16, 2022Updated 3 years ago
Lupin1998 / Awesome-MIM
View on GitHub
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
☆354Apr 23, 2025Updated last year
ucasligang / SemMAE
View on GitHub
[NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders
☆43Jun 18, 2023Updated 3 years ago
LightDXY / BootMAE
View on GitHub
ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining
☆97Nov 2, 2022Updated 3 years ago
ucasligang / awesome-Diffusion
View on GitHub
Reading list for research topics in Diffusion models.
☆18Jan 12, 2024Updated 2 years ago
ucasligang / SimViT
View on GitHub
[ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.
☆67Oct 11, 2022Updated 3 years ago
donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆19Jan 3, 2023Updated 3 years ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,375Jul 23, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
enyac-group / supmae
View on GitHub
This is a offical PyTorch/GPU implementation of SupMAE.
☆80Aug 30, 2022Updated 3 years ago
pengzhiliang / MAE-pytorch
View on GitHub
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
☆2,691Jul 25, 2023Updated 3 years ago
EPFL-VILAB / MultiMAE
View on GitHub
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
☆634Dec 13, 2022Updated 3 years ago
ttengwang / Awesome_Prompting_Papers_in_Computer_Vision
View on GitHub
A curated list of prompt-based paper in computer vision and vision-language learning.
☆927Dec 18, 2023Updated 2 years ago
facebookresearch / long_seq_mae
View on GitHub
code release of research paper "Exploring Long-Sequence Masked Autoencoders"
☆100Oct 14, 2022Updated 3 years ago
yzhuoning / Awesome-CLIP
View on GitHub
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
☆1,229Jun 28, 2024Updated 2 years ago
Westlake-AI / openmixup
View on GitHub
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
☆658Oct 15, 2025Updated 9 months ago
ShoufaChen / AdaptFormer
View on GitHub
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
☆388Sep 16, 2022Updated 3 years ago
amazon-science / bigdetection
View on GitHub
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
☆399Oct 23, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ViTAE-Transformer / ViTDet
View on GitHub
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"
☆586Apr 24, 2022Updated 4 years ago
sunsmarterjie / beyond_masking
View on GitHub
Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers
☆26Apr 12, 2022Updated 4 years ago
facebookresearch / asym-siam
View on GitHub
PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)
☆98May 2, 2022Updated 4 years ago
WXinlong / DenseCL
View on GitHub
Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.
☆570Dec 26, 2023Updated 2 years ago
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆791Feb 9, 2023Updated 3 years ago
facebookresearch / data2vec_vision
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆81Jan 7, 2026Updated 6 months ago
czczup / ViT-Adapter
View on GitHub
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
☆1,503Jun 3, 2025Updated last year
facebookresearch / r-mae
View on GitHub
PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411
☆112Jun 9, 2023Updated 3 years ago
baaivision / EVA
View on GitHub
EVA Series: Visual Representation Fantasies from BAAI
☆2,685Aug 1, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gaopengcuhk / Stable-Pix2Seq
View on GitHub
A full-fledged version of Pix2Seq
☆237Nov 6, 2021Updated 4 years ago
open-mmlab / mmselfsup
View on GitHub
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
☆3,302Jun 25, 2023Updated 3 years ago
wisdomikezogwo / MMAE_Pathology
View on GitHub
☆12Oct 4, 2023Updated 2 years ago
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,777Dec 8, 2023Updated 2 years ago
CoinCheung / MFM
View on GitHub
code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)
☆24Feb 3, 2023Updated 3 years ago
Tete-Xiao / ReSim
View on GitHub
PyTorch Implementation of Region Similarity Representation Learning (ReSim)
☆90Jul 27, 2021Updated 5 years ago
DirtyHarryLYL / Transformer-in-Vision
View on GitHub
Recent Transformer-based CV and related works.
☆1,344Aug 22, 2023Updated 2 years ago