SHI-Labs / Compact-Transformers
Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)
β515Updated 3 months ago
Alternatives and similar repositories for Compact-Transformers:
Users that are interested in Compact-Transformers are comparing it to the libraries listed below
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.β569Updated last year
- Implementation of ConvMixer for "Patches Are All You Need? π€·"β1,065Updated 2 years ago
- Code for the Convolutional Vision Transformer (ConViT)β467Updated 3 years ago
- EsViT: Efficient self-supervised Vision Transformersβ411Updated last year
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"β549Updated 2 years ago
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".β643Updated 3 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)β1,315Updated 9 months ago
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classificationβ466Updated last year
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"β345Updated last year
- (ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"β809Updated 2 years ago
- PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformersβ227Updated 3 years ago
- An All-MLP solution for Vision, from Google AIβ1,015Updated 5 months ago
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachersβ¦β265Updated last year
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"β427Updated last year
- LeViT a Vision Transformer in ConvNet's Clothing for Faster Inferenceβ608Updated 2 years ago
- Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)β483Updated 3 years ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsificationβ595Updated last year
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022β1,090Updated 9 months ago
- β584Updated 4 months ago
- PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057β1,247Updated 3 years ago
- Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paperβ747Updated 2 years ago
- Implementation of the Swin Transformer in PyTorch.β819Updated 3 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNetβ1,179Updated last year
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"β284Updated 2 years ago
- MetaFormer Baselines for Vision (TPAMI 2024)β444Updated 9 months ago
- Self-supervised vIsion Transformer (SiT)β326Updated 2 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".β955Updated 2 years ago
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformersβ231Updated 3 years ago
- Implementation of Bottleneck Transformer in Pytorchβ676Updated 3 years ago
- Official PyTorch implementation of Fully Attentional Networksβ476Updated last year