Haochen-Wang409 / DropPosLinks
[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
☆61Updated last year
Alternatives and similar repositories for DropPos
Users that are interested in DropPos are comparing it to the libraries listed below
Sorting:
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆90Updated last year
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining☆102Updated 4 months ago
- [ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling☆72Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆107Updated 2 years ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆77Updated 2 years ago
- ☆62Updated 2 years ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆84Updated 3 months ago
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆192Updated 2 years ago
- ☆113Updated last year
- Official implementation of the paper "Masked Autoencoders are Efficient Class Incremental Learners"☆43Updated last year
- The official github repo for "Test-Time Training with Masked Autoencoders"☆88Updated last year
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆117Updated last year
- ☆91Updated 2 years ago
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆71Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆42Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆52Updated 2 years ago
- LiVT PyTorch Implementation.☆72Updated 2 years ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆144Updated 2 years ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆88Updated last year
- [CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"☆43Updated 8 months ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆56Updated last year
- ☆35Updated last year
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆43Updated 2 years ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆114Updated 2 years ago
- Code for "Training on Thin Air: Improve Image Classification with Generated Data"☆48Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆80Updated 5 months ago
- [NeurIPS 2022] Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering☆48Updated last year
- [CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning☆39Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆117Updated 5 months ago
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆104Updated 2 years ago