ChristophReich1996 / Swin-Transformer-V2
PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].
☆189Updated 2 years ago
Alternatives and similar repositories for Swin-Transformer-V2:
Users that are interested in Swin-Transformer-V2 are comparing it to the libraries listed below
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆282Updated 2 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆556Updated last year
- Lite Vision Transformer (CVPR 2022)☆137Updated 2 years ago
- TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022☆389Updated 2 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆363Updated 3 years ago
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction☆370Updated 2 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆161Updated last year
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆328Updated last year
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆343Updated 11 months ago
- ☆211Updated 3 years ago
- Official MegEngine implementation of RepLKNet☆273Updated 2 years ago
- ☆171Updated 3 weeks ago
- This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"☆251Updated 3 years ago
- Implementation of Segformer, Attention + MLP neural network for segmentation, in Pytorch☆349Updated 2 years ago
- iFormer: Inception Transformer☆245Updated 2 years ago
- [T-IP 2023] Code for exponential adaptive pooling for PyTorch☆81Updated last year
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆490Updated last year
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…☆264Updated last year
- Code and models for mobile-former☆119Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆243Updated 2 years ago
- Official ImageNet Model repository☆241Updated last year
- ☆191Updated 2 years ago
- Official Pytorch Implementation of SegViT: Semantic Segmentation with Plain Vision Transformers☆233Updated last year
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆821Updated 9 months ago
- ☆245Updated 2 years ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆439Updated 7 months ago
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆549Updated 2 years ago
- ☆191Updated last year
- iclr2024 poster Varying Window Attention☆119Updated 3 months ago
- [NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation☆251Updated last year