ChenMnZ / CF-ViT
(AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"
☆103Updated last year
Alternatives and similar repositories for CF-ViT:
Users that are interested in CF-ViT are comparing it to the libraries listed below
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆93Updated 2 years ago
- The official implementation for ALOFT (CVPR 2023).☆52Updated last year
- Codes for ECCV2022 paper - contrastive deep supervision☆68Updated 2 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆62Updated 2 years ago
- Official code for Scale Decoupled Distillation☆37Updated 9 months ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆54Updated 2 years ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆50Updated 2 years ago
- ☆83Updated last year
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- [ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"☆77Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆103Updated last year
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆52Updated last year
- ☆211Updated 3 years ago
- ☆83Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆46Updated last year
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆88Updated last year
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆134Updated last year
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆58Updated last year
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆126Updated 2 years ago
- ☆132Updated 6 months ago
- ☆58Updated last year
- ☆25Updated last year
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆138Updated 2 years ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆60Updated 3 weeks ago
- ☆77Updated last year
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆31Updated 6 months ago
- ☆58Updated 2 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆154Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆71Updated 5 months ago
- [CVPR2022] Representation Compensation Networks for Continual Semantic Segmentation☆101Updated last year