wofmanaf / ResTLinks
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆287Updated 2 years ago
Alternatives and similar repositories for ResT
Users that are interested in ResT are comparing it to the libraries listed below
Sorting:
- Official MegEngine implementation of RepLKNet☆277Updated 3 years ago
- ☆215Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆604Updated 2 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆162Updated 3 years ago
- Bottleneck Transformers for Visual Recognition☆279Updated 4 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆213Updated last month
- ☆195Updated 2 years ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆116Updated last year
- The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks☆219Updated 3 months ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆339Updated last year
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆577Updated last year
- Lite Vision Transformer (CVPR 2022)☆144Updated 2 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆120Updated 3 years ago
- ☆192Updated 2 years ago
- Implementation of Convolutional enhanced image Transformer☆105Updated 4 years ago
- [ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction☆206Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆289Updated 3 years ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆228Updated 3 years ago
- [ICCV 2021] Code for approximated exponential maximum pooling☆297Updated 2 years ago
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction☆379Updated 3 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆156Updated 3 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆534Updated 3 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆357Updated 2 years ago
- This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"☆256Updated 4 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆186Updated 2 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆163Updated 2 years ago
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆126Updated 2 years ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆140Updated 3 years ago
- ☆183Updated 7 months ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" (CVPR 2022)☆199Updated 2 years ago