ml-jku / MIM-Refiner
A Contrastive Learning Boost from Intermediate Pre-Trained Representations
☆42Updated 6 months ago
Alternatives and similar repositories for MIM-Refiner:
Users that are interested in MIM-Refiner are comparing it to the libraries listed below
- ☆32Updated 11 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆97Updated 10 months ago
- ☆52Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- PyTorch code and pretrained weights for the UNIC models.☆28Updated 6 months ago
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆48Updated 3 weeks ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 6 months ago
- Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations☆28Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆103Updated 3 months ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆112Updated last year
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆113Updated 5 months ago
- ☆50Updated last year
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Updated last year
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆53Updated last year
- A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".☆84Updated last year
- ☆19Updated 3 weeks ago
- ☆42Updated last week
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆76Updated last year
- ☆36Updated 2 months ago
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆59Updated 10 months ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated 10 months ago
- ☆23Updated 5 months ago
- ☆58Updated last year
- ☆43Updated 2 months ago
- Code release for "Improved baselines for vision-language pre-training"☆60Updated 10 months ago
- This repo contains the code and configuration files for reproducing object detection results of FocalNets with DINO☆67Updated 2 years ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆41Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆74Updated last year
- Official repository of paper "Subobject-level Image Tokenization"☆65Updated 11 months ago