wofmanaf / ResT
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆282Updated 2 years ago
Alternatives and similar repositories for ResT:
Users that are interested in ResT are comparing it to the libraries listed below
- ☆211Updated 3 years ago
- ☆191Updated last year
- Official MegEngine implementation of RepLKNet☆273Updated 2 years ago
- Lite Vision Transformer (CVPR 2022)☆137Updated 2 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆556Updated last year
- Bottleneck Transformers for Visual Recognition☆275Updated 3 years ago
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆205Updated last year
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆595Updated last year
- Implementation of Convolutional enhanced image Transformer☆102Updated 3 years ago
- The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks☆215Updated 3 years ago
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆549Updated 2 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆151Updated 3 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆349Updated 2 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆527Updated 3 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].☆189Updated 2 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆328Updated last year
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆343Updated 11 months ago
- This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"☆251Updated 3 years ago
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆191Updated 2 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆184Updated 2 years ago
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆126Updated 2 years ago
- ☆191Updated 2 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆154Updated 2 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆282Updated 2 years ago
- ☆171Updated 3 weeks ago
- [ICCV 2021] Code for approximated exponential maximum pooling☆292Updated 2 years ago
- [ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction☆203Updated 3 years ago
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction☆370Updated 2 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆119Updated 3 years ago
- [T-IP 2023] Code for exponential adaptive pooling for PyTorch☆81Updated last year