Meituan-AutoML / CPVT
☆192Updated last year
Alternatives and similar repositories for CPVT:
Users that are interested in CPVT are comparing it to the libraries listed below
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆184Updated 2 years ago
- ☆211Updated 3 years ago
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆191Updated 2 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆282Updated 2 years ago
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆126Updated 2 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆150Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆281Updated 2 years ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆115Updated last year
- [CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning☆286Updated 2 years ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆115Updated 2 years ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆193Updated 2 years ago
- ☆119Updated 2 years ago
- Lite Vision Transformer (CVPR 2022)☆136Updated 2 years ago
- Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021☆336Updated 3 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆150Updated 3 years ago
- The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks☆215Updated 3 years ago
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.☆156Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆242Updated 2 years ago
- MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)☆82Updated 2 years ago
- reproduction of semantic segmentation using masked autoencoder (mae)☆160Updated 2 years ago
- ☆108Updated 3 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆490Updated last year
- Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones☆198Updated 3 years ago
- [NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.☆173Updated 2 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆154Updated 2 years ago
- iFormer: Inception Transformer☆243Updated 2 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆248Updated 3 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆426Updated last year
- ☆98Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆260Updated last year