Atten4Vis / CAE
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
☆97Updated last year
Alternatives and similar repositories for CAE:
Users that are interested in CAE are comparing it to the libraries listed below
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆59Updated 8 months ago
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆88Updated last year
- The official implementation of CMAE https://arxiv.org/abs/2207.13532 and https://ieeexplore.ieee.org/document/10330745☆88Updated 11 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆103Updated last year
- ☆55Updated 6 months ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆134Updated last year
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆105Updated 10 months ago
- ☆77Updated last year
- ☆57Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆46Updated last year
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆89Updated 7 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆76Updated 4 months ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆215Updated 4 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆68Updated 7 months ago
- ☆83Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆99Updated last month
- ☆40Updated last year
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆31Updated 6 months ago
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆65Updated 5 months ago
- ☆56Updated 5 months ago
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆101Updated 6 months ago
- ☆58Updated last year
- ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No☆131Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆71Updated 5 months ago
- ImageNet-1K data download, processing for using as a dataset☆77Updated last year
- Open source implementation of "Vision Transformers Need Registers"☆162Updated 2 months ago
- [ECCV 2022] Official Implementation for Unsupervised Selective Labeling for More Effective Semi-Supervised Learning☆60Updated last year
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆192Updated 9 months ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆167Updated last year