vladhondru25 / MIM-SurveyLinks
This repository categorizes the papers about masked image modeling according to their main contributions. The classification is based on our survey: https://arxiv.org/abs/2408.06687.
☆21Updated 5 months ago
Alternatives and similar repositories for MIM-Survey
Users that are interested in MIM-Survey are comparing it to the libraries listed below
Sorting:
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆78Updated 2 years ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆83Updated 4 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆59Updated last year
- Adapters Strike Back (CVPR 2024)☆38Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆55Updated 11 months ago
- Code for the paper "Do text-free diffusion models learn discriminative visual representations?"☆28Updated last year
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆86Updated 4 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆41Updated 8 months ago
- ☆73Updated 7 months ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆102Updated last year
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆123Updated 7 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆83Updated 6 months ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆53Updated 2 years ago
- [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN☆28Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆62Updated last year
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆41Updated last year
- ☆22Updated 7 months ago
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆67Updated 9 months ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆118Updated last year
- ☆46Updated 8 months ago
- ☆60Updated 2 years ago
- [CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"☆44Updated 9 months ago
- This is an official implementation for PROMPT-CAM: A Simpler Interpretable Transformer for Fine-Grained Analysis (CVPR'25)☆50Updated 6 months ago
- Proteus (ICLR2025)☆54Updated 6 months ago
- [ICML2024]The official implementation of SemiRES in PyTorch.☆29Updated last year
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆27Updated last year
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆42Updated last year
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Updated last year
- ☆41Updated 2 weeks ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆108Updated 2 years ago