vladhondru25 / MIM-SurveyLinks
This repository categorizes the papers about masked image modeling according to their main contributions. The classification is based on our survey: https://arxiv.org/abs/2408.06687.
☆24Updated 6 months ago
Alternatives and similar repositories for MIM-Survey
Users that are interested in MIM-Survey are comparing it to the libraries listed below
Sorting:
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆85Updated 5 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆78Updated 2 years ago
- [CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"☆45Updated 11 months ago
- Adapters Strike Back (CVPR 2024)☆38Updated last year
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆68Updated 10 months ago
- [ICML2024]The official implementation of SemiRES in PyTorch.☆29Updated last year
- [ECCV 2024] Official project of CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning☆42Updated last year
- ☆78Updated 9 months ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆119Updated 2 years ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆62Updated last year
- Code for the paper "Do text-free diffusion models learn discriminative visual representations?"☆31Updated last year
- ☆49Updated 9 months ago
- ☆113Updated last year
- Text-Image Alignment for Diffusion-based Perception (TADP) - CVPR 2024☆40Updated last year
- [ECCV' 24 Oral] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection☆28Updated last year
- The official implementation of CMAE https://arxiv.org/abs/2207.13532 and https://ieeexplore.ieee.org/document/10330745☆111Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆55Updated 2 years ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆87Updated 6 months ago
- [CVPR 2024] Official Implementation of Collaborating Foundation models for Domain Generalized Semantic Segmentation☆74Updated 7 months ago
- This repo is a collection of AWESOME things about continual semantic segmentation, including papers, code, demos, etc. Feel free to pull …☆30Updated last year
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Updated last year
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆60Updated last year
- [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN☆28Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆56Updated last year
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆28Updated last year
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining☆105Updated 7 months ago
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆78Updated last year
- [NeurIPS 2024] Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation☆21Updated 8 months ago
- ☆62Updated 2 years ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆86Updated last year