wgcban / adamaeLinks
[CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
☆79Updated last year
Alternatives and similar repositories for adamae
Users that are interested in adamae are comparing it to the libraries listed below
Sorting:
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆76Updated 2 years ago
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling☆95Updated last month
- ☆47Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆49Updated last year
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆83Updated 2 years ago
- ☆60Updated 2 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Updated 2 months ago
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆38Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆106Updated last year
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆69Updated last year
- ☆79Updated 2 years ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated last year
- [ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling☆71Updated last year
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆42Updated 2 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆53Updated 3 weeks ago
- A curated list of awesome self-supervised learning methods in videos☆140Updated last month
- The official github repo for "Test-Time Training with Masked Autoencoders"☆83Updated last year
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- ☆109Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆40Updated last year
- [CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning☆38Updated last year
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆57Updated last year
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆79Updated 4 months ago
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆32Updated last year
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated 2 years ago
- Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)☆38Updated 2 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆50Updated 5 months ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆108Updated 11 months ago