wgcban / adamae
[CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
☆74Updated last year
Alternatives and similar repositories for adamae:
Users that are interested in adamae are comparing it to the libraries listed below
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆73Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated 10 months ago
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆90Updated last year
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆35Updated last year
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆28Updated 9 months ago
- ☆58Updated 2 years ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆69Updated 7 months ago
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆56Updated last year
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆110Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆46Updated last year
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆69Updated last year
- ☆47Updated 2 years ago
- ☆105Updated 11 months ago
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆60Updated last month
- ☆75Updated last year
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆74Updated 6 months ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Updated last year
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆49Updated last month
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Updated last year
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆96Updated last year
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆41Updated 2 years ago
- ☆54Updated 2 years ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆25Updated last month
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆69Updated 8 months ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆111Updated 4 months ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆52Updated last year
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆98Updated 10 months ago
- ☆52Updated last year
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated 9 months ago