Annbless / ViTAE
The official pytorch implementation of ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias
☆103Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ViTAE
- Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)☆96Updated 2 years ago
- The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration☆86Updated 2 years ago
- ☆49Updated 2 years ago
- ☆108Updated 3 years ago
- Intra-class Feature Variation Distillation for Semantic Segmentation (ECCV 2020)☆71Updated 4 years ago
- MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)☆81Updated 2 years ago
- ☆98Updated 2 years ago
- This repo contains the code of "ConTNet: Why not use convolution and transformer at the same time?"☆95Updated 3 years ago
- MobileFormer in torch☆66Updated 3 years ago
- ☆69Updated last month
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆149Updated 3 years ago
- ☆56Updated 2 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆183Updated 2 years ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆114Updated 2 years ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆53Updated 2 years ago
- ☆82Updated 3 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆144Updated 2 years ago
- Channel-wise Distillation for Semantic Segmentation☆75Updated 3 years ago
- AlignSeg: Feature-Aligned Segmentation Networks (TPAMI 2021)☆128Updated 2 years ago
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆124Updated 2 years ago
- Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning☆96Updated last year
- Official Implementation of DE-CondDETR and DELA-CondDETR in "Towards Data-Efficient Detection Transformers"☆46Updated 2 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆61Updated 2 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- official code for dynamic convolution decomposition☆130Updated 3 years ago
- ☆56Updated 3 years ago
- RF-Next: Efficient Receptive Field Search for CNN(TPAMI2022, CVPR2021) Try it, you wouldn't regret it!☆63Updated last year
- Implementation of Convolutional enhanced image Transformer☆101Updated 3 years ago
- Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation, ECCV 2020 Oral☆66Updated 4 years ago
- This is a knowledge distillation toolbox based on mmsegmentation.☆43Updated last year