xxxnell / how-do-vits-workLinks

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

☆820

Alternatives and similar repositories for how-do-vits-work

Users that are interested in how-do-vits-work are comparing it to the libraries listed below

Sorting:

locuslab / convmixer
Implementation of ConvMixer for "Patches Are All You Need? 🤷"
☆1,078Updated 3 years ago
lucidrains / mlp-mixer-pytorch
An All-MLP solution for Vision, from Google AI
☆1,053Updated 4 months ago
SHI-Labs / Compact-Transformers
Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)
☆537Updated last year
sail-sg / poolformer
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,356Updated last year
facebookresearch / convit
Code for the Convolutional Vision Transformer (ConViT)
☆470Updated 4 years ago
SHI-Labs / Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
☆1,160Updated last year
microsoft / SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,008Updated 3 years ago
microsoft / CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆587Updated 2 years ago
microsoft / esvit
EsViT: Efficient self-supervised Vision Transformers
☆411Updated 2 years ago
katsura-jp / pytorch-cosine-annealing-with-warmup
☆466Updated 2 years ago
microsoft / Focal-Transformer
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"
☆559Updated 3 years ago
jacobgil / vit-explain
Explainability for Vision Transformers
☆1,019Updated 3 years ago
facebookresearch / msn
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
☆463Updated 3 years ago
SwinTransformer / Transformer-SSL
This is an official implementation for "Self-Supervised Learning with Swin Transformers".
☆665Updated 4 years ago
Alibaba-MIIL / ImageNet21K
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
☆776Updated 2 years ago
yitu-opensource / T2T-ViT
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,190Updated 2 years ago
chinhsuanwu / coatnet-pytorch
A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes"
☆392Updated 4 years ago
hila-chefer / Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…
☆878Updated 2 years ago
EPFL-VILAB / MultiMAE
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
☆608Updated 2 years ago
DirtyHarryLYL / Transformer-in-Vision
Recent Transformer-based CV and related works.
☆1,336Updated 2 years ago
sail-sg / Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
☆803Updated 5 months ago
Alpha-VL / ConvMAE
ConvMAE: Masked Convolution Meets Masked Autoencoders
☆519Updated 2 years ago
bytedance / ibot
iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)
☆754Updated 3 years ago
NVlabs / FAN
Official PyTorch implementation of Fully Attentional Networks
☆481Updated 2 years ago
facebookresearch / moco-v3
PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057
☆1,305Updated 4 years ago
NVlabs / GCVit
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
☆441Updated last year
The-AI-Summer / self-attention-cv
Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.
☆1,211Updated 4 years ago
google-research / maxvit
[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmen…
☆489Updated 2 years ago
fadel / pytorch_ema
Tiny PyTorch library for maintaining a moving average of a collection of parameters.
☆439Updated last year
Sense-X / UniFormer
[ICLR2022] official implementation of UniFormer
☆889Updated last year