wofmanaf / ResTLinks
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆287Updated 2 years ago
Alternatives and similar repositories for ResT
Users that are interested in ResT are comparing it to the libraries listed below
Sorting:
- Official MegEngine implementation of RepLKNet☆277Updated 3 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆340Updated last year
- ☆215Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆605Updated 2 years ago
- Bottleneck Transformers for Visual Recognition☆279Updated 4 years ago
- The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks☆219Updated 4 months ago
- Lite Vision Transformer (CVPR 2022)☆144Updated 2 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆120Updated 3 years ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆115Updated last year
- Official implement of "CAT: Cross Attention in Vision Transformer".☆162Updated 3 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆213Updated 2 months ago
- Implementation of Convolutional enhanced image Transformer☆105Updated 4 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆581Updated last year
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction☆379Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆289Updated 3 years ago
- This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"☆256Updated 4 years ago
- ☆192Updated 2 years ago
- [ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction☆206Updated 3 years ago
- Simple implementation of Mobile-Former on Pytorch☆108Updated 3 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆534Updated 4 years ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆227Updated 3 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆163Updated 2 years ago
- [ICCV 2021] Code for approximated exponential maximum pooling☆297Updated 2 years ago
- ☆195Updated 2 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆186Updated 2 years ago
- ☆199Updated last year
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆156Updated 4 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆357Updated 2 years ago
- RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality (CVPR 2022)☆306Updated 2 years ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆362Updated last year