ml-jku / MIM-RefinerLinks

A Contrastive Learning Boost from Intermediate Pre-Trained Representations

☆42

Alternatives and similar repositories for MIM-Refiner

Users that are interested in MIM-Refiner are comparing it to the libraries listed below

Sorting:

ml-jku / MAE-CT
☆32Updated last year
amazon-science / semi-vit
PyTorch implementation of Semi-supervised Vision Transformers
☆61Updated 2 years ago
kirill-vish / Beyond-INet
Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"
☆101Updated last year
facebookresearch / r-mae
PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411
☆113Updated 2 years ago
facebookresearch / maws
Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496
☆91Updated 7 months ago
TonyLianLong / CrossMAE
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
☆123Updated 7 months ago
Expedit-LargeScale-Vision-Transformer / Expedit-SAM
[NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…
☆85Updated 2 years ago
TomerRonen34 / mixed-resolution-vit
☆54Updated 2 years ago
OliverRensu / DeepMIM
[WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling
☆54Updated 6 months ago
alinlab / SelfPatch
☆61Updated 2 years ago
OliverRensu / D-iGPT
[ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…
☆98Updated last year
mcahny / rovit
RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"
☆18Updated 2 years ago
enyac-group / supmae
This is a offical PyTorch/GPU implementation of SupMAE.
☆79Updated 3 years ago
facebookresearch / clip-rocket
Code release for "Improved baselines for vision-language pre-training"
☆61Updated last year
naver / unic
PyTorch code and pretrained weights for the UNIC models.
☆41Updated last year
ChenhongyiYang / GPViT
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
☆101Updated 2 years ago
WalBouss / GEM
[CVPR24] Official Implementation of GEM (Grounding Everything Module)
☆132Updated 7 months ago
Haochen-Wang409 / HPM
[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining
☆105Updated 7 months ago
naver / trex
PyTorch implementation of the paper "No reason for no supervision: Improving the generalization of supervised models"
☆18Updated 2 years ago
vlfom / RNCDL
[NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".
☆111Updated 2 years ago
naver-ai / cl-vs-mim
(ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"
☆114Updated last year
bwconrad / flexivit
PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes
☆63Updated last year
facebookresearch / SIE
Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations
☆30Updated 2 years ago
sail-sg / mugs
A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".
☆84Updated last year
karttikeya / minREV
A simple minimal implementation of Reversible Vision Transformers
☆126Updated last year
facebookresearch / asym-siam
PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)
☆99Updated 3 years ago
alexandre-eymael / CropMAE
[ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"
☆60Updated 8 months ago
Vibashan / Mask-free-OVIS
Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]
☆51Updated 3 weeks ago
naver-ai / augsub
[CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"
☆45Updated 7 months ago
mbanani / lgssl
[CVPR 2023] Learning Visual Representations via Language-Guided Sampling
☆149Updated 2 years ago