[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
☆353Apr 23, 2025Updated 11 months ago
Alternatives and similar repositories for Awesome-MIM
Users that are interested in Awesome-MIM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).☆861Jul 10, 2024Updated last year
- Reading list for research topics in Masked Image Modeling☆335Dec 3, 2024Updated last year
- This repository categorizes the papers about masked image modeling according to their main contributions. The classification is based on …☆26May 3, 2025Updated 10 months ago
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining☆107Apr 16, 2025Updated 11 months ago
- [ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling☆74Apr 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ECCV 2022 Oral] AutoMix: Unveiling the Power of Mixup for Stronger Classifiers☆18Apr 25, 2023Updated 2 years ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆61Feb 27, 2025Updated last year
- [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN☆30Aug 15, 2024Updated last year
- MIMIC: Masked Image Modeling with Image Correspondences☆17Jun 14, 2024Updated last year
- CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark☆657Oct 15, 2025Updated 5 months ago
- The official implementation of CMAE https://arxiv.org/abs/2207.13532 and https://ieeexplore.ieee.org/document/10330745☆119Jan 27, 2024Updated 2 years ago
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆115Mar 13, 2024Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆523Mar 14, 2023Updated 3 years ago
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,256Jul 23, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆10Sep 22, 2024Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆62Apr 30, 2024Updated last year
- This repo focuses on supervised and self-supervised bio-sequence representation learning☆22Oct 11, 2023Updated 2 years ago
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆84Aug 16, 2022Updated 3 years ago
- A simple Computer Vision Framework, mainly based on PyTorch. Including distributed training, logging and so on.☆12Dec 2, 2023Updated 2 years ago
- A curated list of prompt-based paper in computer vision and vision-language learning.☆925Dec 18, 2023Updated 2 years ago
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆1,059Mar 2, 2024Updated 2 years ago
- Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆37Apr 3, 2023Updated 2 years ago
- List of papers that combine self-supervision and continual learning☆80Mar 12, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆126Nov 28, 2023Updated 2 years ago
- The official implementation of paper "Unified Self-Supervised Learning Framework for Remote Sensing Images".☆101Oct 26, 2024Updated last year
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆69Nov 17, 2023Updated 2 years ago
- iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)☆767Apr 14, 2022Updated 3 years ago
- [T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey☆756Aug 25, 2024Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆81Apr 28, 2023Updated 2 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,030Sep 29, 2022Updated 3 years ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆86Oct 26, 2025Updated 5 months ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆93Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for …☆1,368Jan 23, 2024Updated 2 years ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆133Apr 10, 2025Updated 11 months ago
- [MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.☆130Sep 16, 2022Updated 3 years ago
- A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis☆577Mar 10, 2023Updated 3 years ago
- A comprehensive list of awesome contrastive self-supervised learning papers.☆1,308Sep 10, 2024Updated last year
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆26Nov 27, 2024Updated last year
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆5,024Jul 30, 2024Updated last year