wofmanaf / ResT
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆282Updated 2 years ago
Alternatives and similar repositories for ResT:
Users that are interested in ResT are comparing it to the libraries listed below
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆205Updated last year
- ☆212Updated 3 years ago
- Bottleneck Transformers for Visual Recognition☆275Updated 3 years ago
- Official MegEngine implementation of RepLKNet☆273Updated 2 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆597Updated 2 years ago
- ☆191Updated 2 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆557Updated last year
- [ICCV 2021] Code for approximated exponential maximum pooling☆292Updated 2 years ago
- This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"☆253Updated 3 years ago
- The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks☆215Updated 3 years ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆227Updated 2 years ago
- Implementation of Convolutional enhanced image Transformer☆103Updated 3 years ago
- Lite Vision Transformer (CVPR 2022)☆137Updated 2 years ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆115Updated last year
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆549Updated 2 years ago
- ☆191Updated 2 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆157Updated 2 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆327Updated last year
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆282Updated 2 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆527Updated 3 years ago
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆191Updated 2 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆152Updated 3 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆364Updated 3 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆119Updated 3 years ago
- ☆171Updated last month
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆349Updated 2 years ago
- Code for our ICASSP 2021 paper: SA-Net: Shuffle Attention for Deep Convolutional Neural Networks☆255Updated 3 years ago
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆231Updated 3 years ago
- iFormer: Inception Transformer☆245Updated 2 years ago
- Simple implementation of Mobile-Former on Pytorch☆108Updated 3 years ago