lucidrains / transformer-in-transformerLinks

Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorch

☆305

Alternatives and similar repositories for transformer-in-transformer

Users that are interested in transformer-in-transformer are comparing it to the libraries listed below

Sorting:

lucidrains / halonet-pytorch
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
☆199Updated 4 years ago
rishikksh20 / convolution-vision-transformers
PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers
☆226Updated 4 years ago
ShoufaChen / CycleMLP
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆289Updated 3 years ago
microsoft / Focal-Transformer
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"
☆556Updated 3 years ago
Sara-Ahmed / SiT
Self-supervised vIsion Transformer (SiT)
☆336Updated 2 years ago
facebookresearch / convit
Code for the Convolutional Vision Transformer (ConViT)
☆466Updated 3 years ago
microsoft / vision-longformer
☆247Updated 3 years ago
houqb / VisionPermutator
MLP-Like Vision Permutator for Visual Recognition (PyTorch)
☆191Updated 3 years ago
tahmid0007 / VisualTransformers
A Pytorch Implementation of the following paper "Visual Transformers: Token-based Image Representation and Processing for Computer Vision…
☆182Updated 4 years ago
leaderj1001 / LambdaNetworks
Implementing Lambda Networks using Pytorch
☆140Updated 4 years ago
zhoudaquan / dvit_repo
☆138Updated 3 years ago
gupta-abhay / pytorch-vit
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
☆296Updated 3 years ago
zihangJiang / TokenLabeling
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
☆431Updated last year
rishikksh20 / MLP-Mixer-pytorch
Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
☆218Updated 4 years ago
Meituan-AutoML / CPVT
☆191Updated 2 years ago
lucidrains / pixel-level-contrastive-learning
Implementation of Pixel-level Contrastive Learning, proposed in the paper "Propagate Yourself", in Pytorch
☆259Updated 4 years ago
microsoft / esvit
EsViT: Efficient self-supervised Vision Transformers
☆413Updated last year
mlpc-ucsd / CoaT
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
☆232Updated 3 years ago
naver-ai / pit
☆247Updated 3 years ago
cmsflash / efficient-attention
An implementation of the efficient attention module.
☆319Updated 4 years ago
csrhddlam / axial-deeplab
This is a PyTorch re-implementation of Axial-DeepLab (ECCV 2020 Spotlight)
☆456Updated 4 years ago
leoxiaobin / CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆228Updated 3 years ago
lukemelas / do-you-even-need-attention
Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)
☆486Updated 4 years ago
SwinTransformer / Transformer-SSL
This is an official implementation for "Self-Supervised Learning with Swin Transformers".
☆657Updated 4 years ago
alohays / awesome-visual-representation-learning-with-transformers
Awesome Transformers (self-attention) in Computer Vision
☆271Updated 3 years ago
benjs / nfnets_pytorch
Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".
☆159Updated 4 years ago
blackfeather-wang / Dynamic-Vision-Transformer
Accelerating T2t-ViT by 1.6-3.6x.
☆252Updated 3 years ago
yhlleo / VTs-Drloc
[NeurIPS 2021] Official codes for "Efficient Training of Visual Transformers with Small Datasets".
☆144Updated 6 months ago
lucidrains / res-mlp-pytorch
Implementation of ResMLP, an all MLP solution to image classification, in Pytorch
☆198Updated 2 years ago
wzlxjtu / PositionalEncoding2D
A PyTorch implementation of the 1d and 2d Sinusoidal positional encoding/embedding.
☆252Updated 4 years ago