facebookresearch / LeViTLinks

LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference

☆616

Alternatives and similar repositories for LeViT

Users that are interested in LeViT are comparing it to the libraries listed below

Sorting:

megvii-model / RepVGG
☆321Updated 3 years ago
microsoft / Focal-Transformer
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"
☆556Updated 3 years ago
microsoft / CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆581Updated 2 years ago
zihangJiang / TokenLabeling
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
☆432Updated last year
NVlabs / FAN
Official PyTorch implementation of Fully Attentional Networks
☆479Updated 2 years ago
chinhsuanwu / mobilevit-pytorch
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
☆538Updated 3 years ago
DingXiaoH / RepLKNet-pytorch
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)
☆923Updated last year
Meituan-AutoML / Twins
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆604Updated 2 years ago
raoyongming / DynamicViT
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
☆618Updated 2 years ago
DingXiaoH / RepMLP
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality (CVPR 2022)
☆306Updated 2 years ago
VITA-Group / SLaK
[ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…
☆275Updated 2 years ago
MegEngine / RepLKNet
Official MegEngine implementation of RepLKNet
☆277Updated 3 years ago
imankgoyal / NonDeepNetworks
Official Code for "Non-deep Networks"
☆585Updated 2 years ago
leoxiaobin / CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆228Updated 3 years ago
facebookresearch / convit
Code for the Convolutional Vision Transformer (ConViT)
☆466Updated 3 years ago
microsoft / CSWin-Transformer
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022
☆577Updated last year
snap-research / EfficientFormer
EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]
☆1,061Updated last year
mmaaz60 / EdgeNeXt
[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Ap…
☆384Updated 2 years ago
mlpc-ucsd / CoaT
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
☆233Updated 3 years ago
microsoft / DynamicHead
☆648Updated 2 years ago
wofmanaf / ResT
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆287Updated 2 years ago
sail-sg / metaformer
MetaFormer Baselines for Vision (TPAMI 2024)
☆477Updated last year
sail-sg / poolformer
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,347Updated last year
dingmyu / davit
[ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"
☆361Updated last year
raoyongming / HorNet
[NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions
☆339Updated last year
ShoufaChen / CycleMLP
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆289Updated 3 years ago
JDAI-CV / CoTNet
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
☆534Updated 3 years ago
hkzhang-git / ParC-Net
[ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"
☆357Updated 2 years ago
DingXiaoH / DiverseBranchBlock
Diverse Branch Block: Building a Convolution as an Inception-like Unit
☆340Updated 2 years ago
yitu-opensource / T2T-ViT
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,187Updated last year