berniwal / swin-transformer-pytorch
Implementation of the Swin Transformer in PyTorch.
☆817Updated 3 years ago
Alternatives and similar repositories for swin-transformer-pytorch:
Users that are interested in swin-transformer-pytorch are comparing it to the libraries listed below
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,176Updated last year
- Official implementation of PVT series☆1,773Updated 2 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,312Updated 8 months ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆556Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation…☆1,202Updated 2 years ago
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆640Updated 3 years ago
- Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)☆891Updated 9 months ago
- [CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator☆1,309Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆597Updated 2 years ago
- PyTorch implementation of EfficientNetV2 family☆463Updated 3 years ago
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆826Updated 10 months ago
- [CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers☆1,060Updated 5 months ago
- Code for our CVPR2021 paper coordinate attention☆1,038Updated 3 years ago
- Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.☆1,192Updated 3 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆950Updated 2 years ago
- Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition☆561Updated 3 years ago
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆465Updated last year
- Code for ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks☆1,304Updated 3 years ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆566Updated last year
- ☆798Updated 2 years ago
- A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"☆518Updated 3 years ago
- [ICLR2022] official implementation of UniFormer☆845Updated 10 months ago
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆1,990Updated 2 years ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and …☆1,842Updated last year
- code and trained models for "Attentional Feature Fusion"☆763Updated 3 years ago
- Awesome List of Attention Modules and Plug&Play Modules in Computer Vision☆1,140Updated last year
- Official repository of ACmix (CVPR2022)☆404Updated 2 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆364Updated 3 years ago
- FcaNet: Frequency Channel Attention Networks☆529Updated 3 years ago
- A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes"☆378Updated 3 years ago