This repository categorizes the papers about masked image modeling according to their main contributions. The classification is based on our survey: https://arxiv.org/abs/2408.06687.
☆26May 3, 2025Updated 10 months ago
Alternatives and similar repositories for MIM-Survey
Users that are interested in MIM-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "CocoNet: A deep neural network for mapping pixel coordinates to color values" paper☆11Aug 30, 2018Updated 7 years ago
- [ICCV W] Contextual Convolutional Neural Networks (https://arxiv.org/pdf/2108.07387.pdf)☆14Aug 18, 2021Updated 4 years ago
- [TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval☆31Jan 6, 2026Updated 2 months ago
- Tutorial about noisy labels for SIBGRAPI 2020☆11Nov 6, 2020Updated 5 years ago
- ☆35Jul 31, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Deep learning based retinal vessel segmentation for wide-field fundus photography retinal images, IEEE Trans. Medical Imaging, 2020☆18Nov 23, 2020Updated 5 years ago
- No.5 solution to non-targeted attack in IJCAI-2019 Alibaba Adversarial AI Challenge (AAAC 2019))☆11Oct 27, 2020Updated 5 years ago
- ☆19Nov 22, 2022Updated 3 years ago
- ☆30Updated this week
- ☆10Jun 14, 2022Updated 3 years ago
- Official code for "Boosting the Adversarial Transferability of Surrogate Model with Dark Knowledge"☆12Dec 22, 2023Updated 2 years ago
- Benchmark and analysis of 165 pretrained SSL models. Code for "Evaluating Self-Supervised Learning via Risk Decomposition".☆14Jul 26, 2023Updated 2 years ago
- [CVPR 2025] ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting☆33Dec 5, 2024Updated last year
- ☆22Sep 16, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Jun 14, 2025Updated 9 months ago
- Official repository for "On the Multi-modal Vulnerability of Diffusion Models"☆16Jul 15, 2024Updated last year
- [ECCV2024] AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking. Official Pytorch Implementation of An…☆35Sep 23, 2024Updated last year
- ☆14Sep 22, 2025Updated 6 months ago
- Multi-level Attention Network for Retinal Vessel Segmentation☆11May 10, 2021Updated 4 years ago
- ☆11Sep 27, 2023Updated 2 years ago
- Official implementation of LSSVC: A Learned Spatially Scalable Video Coding Scheme.☆11Apr 1, 2025Updated 11 months ago
- ☆12Jul 1, 2023Updated 2 years ago
- M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision (ICCV 2025)☆31Nov 19, 2025Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆11Feb 2, 2026Updated last month
- ☆17Sep 25, 2019Updated 6 years ago
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆60Nov 27, 2025Updated 4 months ago
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆16Jul 14, 2023Updated 2 years ago
- RETFound - A foundation model for retinal image☆103Oct 14, 2023Updated 2 years ago
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining☆107Apr 16, 2025Updated 11 months ago
- RGB-T semantic segmentation network☆13Apr 1, 2023Updated 2 years ago
- ☆12Aug 24, 2020Updated 5 years ago
- ☆17Nov 4, 2025Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆10Sep 22, 2024Updated last year
- XGEN-MM(BLIP3) Autocaptioning Tools☆17Jun 20, 2024Updated last year
- This repository contains some of the multi-view datasets that are often used in our research.☆17Jan 1, 2025Updated last year
- #ICCV, #MoE, #Tracking☆33Jul 11, 2025Updated 8 months ago
- Official Repository for Westlake Deep Learning Course (2024)☆14Jun 6, 2024Updated last year
- This the code for the paper "On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework, IC…☆12Jul 6, 2021Updated 4 years ago
- ☆18Jun 14, 2025Updated 9 months ago