Haoqing-Wang / LocalMIMLinks
[CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction
☆50Updated 2 years ago
Alternatives and similar repositories for LocalMIM
Users that are interested in LocalMIM are comparing it to the libraries listed below
Sorting:
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆76Updated 2 years ago
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling☆97Updated 2 months ago
- ☆60Updated 2 years ago
- [ICCV 2023] Shrinking Class Space for Enhanced Certainty in Semi-Supervised Learning☆45Updated last year
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆38Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆106Updated last year
- ☆35Updated last year
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆60Updated 5 months ago
- ☆86Updated last year
- The official implementation for ALOFT (CVPR 2023).☆55Updated last year
- [CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"☆41Updated 6 months ago
- The official implementation of CMAE https://arxiv.org/abs/2207.13532 and https://ieeexplore.ieee.org/document/10330745☆107Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆78Updated 3 months ago
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆64Updated 5 months ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated last year
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆81Updated last month
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆34Updated 4 months ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆79Updated last year
- Codes for ECCV2022 paper - contrastive deep supervision☆68Updated 2 years ago
- Official PyTorch implementation of "DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic Segmentation" (CVPR 2023)☆28Updated last year
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆143Updated 2 years ago
- vit for few-shot classification☆47Updated 2 years ago
- ☆44Updated last year
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆44Updated 2 years ago
- Official Implementation of AlignMixup - CVPR 2022☆71Updated 3 years ago
- [CVPR 2024] Open-Set Domain Adaptation for Semantic Segmentation☆43Updated 11 months ago
- This is an official implementation of our NeurIPS 2022 paper "Bridging the Gap Between Vision Transformers and Convolutional Neural Netwo…☆57Updated 2 years ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆76Updated last year
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆114Updated last year
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆64Updated 3 months ago