Jiahao000 / MFMLinks
[ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training
☆75Updated 2 years ago
Alternatives and similar repositories for MFM
Users that are interested in MFM are comparing it to the libraries listed below
Sorting:
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling☆95Updated last month
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆49Updated last year
- ☆34Updated last year
- ☆60Updated 2 years ago
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆38Updated last year
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆63Updated 4 months ago
- Adapters Strike Back (CVPR 2024)☆36Updated 10 months ago
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆24Updated 2 years ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆77Updated this week
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆77Updated last month
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆55Updated last year
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆43Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆106Updated last year
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆142Updated last year
- ☆22Updated 2 years ago
- [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN☆27Updated 9 months ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆79Updated 10 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆82Updated 2 months ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆79Updated last year
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆77Updated 5 months ago
- [CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning☆37Updated 11 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆68Updated 7 months ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆53Updated 3 weeks ago
- [ICCV 2023] Bayesian Prompt Learning for Image-Language Model Generalization☆33Updated last year
- [ICCV 2023] Shrinking Class Space for Enhanced Certainty in Semi-Supervised Learning☆45Updated last year
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆59Updated 4 months ago
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆68Updated last year
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆43Updated last month