FrancescoSaverioZuppichini / ViTLinks

Implementing Vi(sion)T(transformer)

☆431

Alternatives and similar repositories for ViT

Users that are interested in ViT are comparing it to the libraries listed below

Sorting:

lukemelas / PyTorch-Pretrained-ViT
Vision Transformer (ViT) in PyTorch
☆844Updated 3 years ago
jeonsworld / ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,066Updated 3 years ago
jacobgil / vit-explain
Explainability for Vision Transformers
☆986Updated 3 years ago
berniwal / swin-transformer-pytorch
Implementation of the Swin Transformer in PyTorch.
☆842Updated 4 years ago
IcarusWizard / MAE
PyTorch implementation of Masked Autoencoder
☆265Updated 2 years ago
Yangzhangcst / Transformer-in-Computer-Vision
A paper list of some recent Transformer-based CV works.
☆1,337Updated this week
DirtyHarryLYL / Transformer-in-Vision
Recent Transformer-based CV and related works.
☆1,334Updated last year
frgfm / torch-cam
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-…
☆2,232Updated last week
kentaroy47 / vision-transformers-cifar10
Let's train vision transformers (ViT) for cifar 10 / cifar 100!
☆662Updated last month
sail-sg / poolformer
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,347Updated last year
IBM / CrossViT
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
☆399Updated 3 years ago
luo3300612 / Visualizer
assistant tools for attention visualization in deep learning
☆1,197Updated 3 years ago
hila-chefer / Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …
☆1,913Updated last year
lucidrains / mlp-mixer-pytorch
An All-MLP solution for Vision, from Google AI
☆1,034Updated 3 weeks ago
czczup / ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
☆1,388Updated 2 months ago
The-AI-Summer / self-attention-cv
Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.
☆1,210Updated 3 years ago
xxxnell / how-do-vits-work
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
☆819Updated 3 years ago
yitu-opensource / T2T-ViT
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,190Updated last year
microsoft / Cream
This is a collection of our NAS and Vision Transformer work.
☆1,785Updated last year
KMnP / vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
☆1,143Updated last year
facebookresearch / ConvNeXt-V2
Code release for ConvNeXt V2 model
☆1,802Updated 11 months ago
microsoft / CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆581Updated 2 years ago
microsoft / SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆988Updated 2 years ago
chinhsuanwu / coatnet-pytorch
A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes"
☆388Updated 3 years ago
sail-sg / metaformer
MetaFormer Baselines for Vision (TPAMI 2024)
☆477Updated last year
chinhsuanwu / mobilevit-pytorch
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
☆538Updated 3 years ago
DingXiaoH / RepLKNet-pytorch
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)
☆923Updated last year
whai362 / PVT
Official implementation of PVT series
☆1,835Updated 2 years ago
raoyongming / GFNet
[NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification
☆484Updated 2 years ago
LeapLabTHU / DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…
☆887Updated last year