bwconrad / flexivit
PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes
☆59Updated 10 months ago
Alternatives and similar repositories for flexivit:
Users that are interested in flexivit are comparing it to the libraries listed below
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆74Updated last year
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆90Updated last year
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆63Updated 2 years ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆104Updated 3 months ago
- Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"☆68Updated last year
- [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN☆26Updated 7 months ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆112Updated last year
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆106Updated last year
- The official implementation of CMAE https://arxiv.org/abs/2207.13532 and https://ieeexplore.ieee.org/document/10330745☆95Updated last year
- [CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning☆35Updated 9 months ago
- a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.☆73Updated last year
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆35Updated last year
- CVPR2024☆68Updated 2 weeks ago
- ☆62Updated last month
- ☆41Updated last year
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆103Updated last year
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆100Updated last year
- ☆36Updated 2 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆76Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆47Updated last year
- ☆58Updated 2 years ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆97Updated 3 weeks ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆97Updated 10 months ago
- ☆58Updated 2 years ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated 10 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆71Updated 9 months ago
- LiVT PyTorch Implementation.☆67Updated 2 years ago
- Pytorch implementation of Swin MAE https://arxiv.org/abs/2212.13805☆81Updated last year
- ☆84Updated last year
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆76Updated this week