linhezheng19 / CAT
Official implement of "CAT: Cross Attention in Vision Transformer".
☆159Updated 2 years ago
Alternatives and similar repositories for CAT:
Users that are interested in CAT are comparing it to the libraries listed below
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆156Updated 3 years ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆116Updated last year
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆209Updated last year
- ☆215Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆283Updated 2 years ago
- Lite Vision Transformer (CVPR 2022)☆142Updated 2 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆119Updated 3 years ago
- Implementation of Convolutional enhanced image Transformer☆104Updated 4 years ago
- Vision Transformers with Hierarchical Attention☆100Updated 7 months ago
- ☆50Updated 3 years ago
- ☆190Updated 2 years ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆139Updated 2 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆153Updated 3 years ago
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆126Updated 2 years ago
- The official pytorch implementation of ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias☆104Updated 3 years ago
- iFormer: Inception Transformer☆247Updated 2 years ago
- ☆60Updated 3 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆199Updated 4 years ago
- ☆142Updated 8 months ago
- Pytorch Re-Implementation | Dynamic Region-Aware Convolution (ECCV2020)☆104Updated 3 years ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆55Updated 3 years ago
- ☆84Updated last year
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆250Updated 2 years ago
- Simple implementation of Mobile-Former on Pytorch☆108Updated 3 years ago
- ☆176Updated 4 months ago
- The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks☆218Updated 2 weeks ago
- ☆119Updated 3 years ago
- MobileFormer in torch☆66Updated 3 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆184Updated 2 years ago