ChristophReich1996 / MaxViT
PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].
☆161Updated last year
Alternatives and similar repositories for MaxViT:
Users that are interested in MaxViT are comparing it to the libraries listed below
- Lite Vision Transformer (CVPR 2022)☆137Updated 2 years ago
- iFormer: Inception Transformer☆246Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆243Updated 2 years ago
- ☆191Updated 2 years ago
- ☆213Updated 3 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆184Updated 2 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].☆189Updated 2 years ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆345Updated last year
- [T-IP 2023] Code for exponential adaptive pooling for PyTorch☆81Updated last year
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆137Updated 2 years ago
- ☆119Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆267Updated last year
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆282Updated 2 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆157Updated 2 years ago
- Official MegEngine implementation of RepLKNet☆273Updated 2 years ago
- ☆197Updated 6 months ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆283Updated 2 years ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆116Updated 2 years ago
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆191Updated 2 years ago
- Code and models for mobile-former☆121Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆491Updated last year
- ☆170Updated last month
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆152Updated 3 years ago
- [NeurIPS 2021] Official codes for "Efficient Training of Visual Transformers with Small Datasets".☆140Updated last month
- ☆191Updated 2 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆150Updated 3 years ago
- ☆249Updated 2 years ago
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…☆264Updated last year
- Code Release for MViTv2 on Image Recognition.☆416Updated 2 months ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆91Updated 2 years ago