microsoft/CvT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/CvT)

microsoft / CvT

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

☆609

Alternatives and similar repositories for CvT

Users that are interested in CvT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

leoxiaobin / CvT
View on GitHub
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆229Jul 4, 2022Updated 4 years ago
rishikksh20 / convolution-vision-transformers
View on GitHub
PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers
☆226May 26, 2021Updated 5 years ago
rishikksh20 / CeiT-pytorch
View on GitHub
Implementation of Convolutional enhanced image Transformer
☆106Mar 27, 2021Updated 5 years ago
facebookresearch / convit
View on GitHub
Code for the Convolutional Vision Transformer (ConViT)
☆474Oct 25, 2021Updated 4 years ago
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,358Mar 15, 2024Updated 2 years ago
yitu-opensource / T2T-ViT
View on GitHub
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,194Oct 27, 2023Updated 2 years ago
microsoft / vision-longformer
View on GitHub
☆249Mar 16, 2022Updated 4 years ago
mlpc-ucsd / CoaT
View on GitHub
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
☆236Feb 3, 2022Updated 4 years ago
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,415Jan 8, 2023Updated 3 years ago
wofmanaf / ResT
View on GitHub
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆291Sep 28, 2022Updated 3 years ago
microsoft / Focal-Transformer
View on GitHub
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"
☆559Mar 27, 2022Updated 4 years ago
zihangJiang / TokenLabeling
View on GitHub
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
☆436Sep 5, 2023Updated 2 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,003Jul 24, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,423Jun 22, 2026Updated 3 weeks ago
sail-sg / poolformer
View on GitHub
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,363Jun 1, 2024Updated 2 years ago
SHI-Labs / Compact-Transformers
View on GitHub
Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)
☆546Nov 5, 2024Updated last year
google-research / vision_transformer
View on GitHub
☆12,631Jul 9, 2026Updated last week
Sara-Ahmed / SiT
View on GitHub
Self-supervised vIsion Transformer (SiT)
☆335Dec 24, 2022Updated 3 years ago
sail-sg / volo
View on GitHub
VOLO: Vision Outlooker for Visual Recognition
☆948Sep 18, 2022Updated 3 years ago
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,047Sep 29, 2022Updated 3 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,993Updated this week
facebookresearch / LeViT
View on GitHub
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
☆624Aug 27, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dk-liang / Awesome-Visual-Transformer
View on GitHub
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,587Jan 7, 2025Updated last year
facebookresearch / FixRes
View on GitHub
This repository reproduces the results of the paper: "Fixing the train-test resolution discrepancy" https://arxiv.org/abs/1906.06423
☆1,043Aug 11, 2021Updated 4 years ago
raoyongming / DynamicViT
View on GitHub
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
☆668Jul 11, 2023Updated 3 years ago
zhoudaquan / Refiner_ViT
View on GitHub
☆110Sep 15, 2021Updated 4 years ago
huawei-noah / Efficient-AI-Backbones
View on GitHub
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
☆4,416Mar 15, 2025Updated last year
naver-ai / pit
View on GitHub
☆245Jul 23, 2021Updated 4 years ago
d-li14 / involution
View on GitHub
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
☆1,311Jul 16, 2021Updated 5 years ago
JDAI-CV / CoTNet
View on GitHub
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
☆538Aug 8, 2021Updated 4 years ago
microsoft / FocalNet
View on GitHub
[NeurIPS 2022] Official code for "Focal Modulation Networks"
☆749Nov 7, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
apple / ml-cvnets
View on GitHub
CVNets: A library for training computer vision networks
☆1,975Oct 30, 2023Updated 2 years ago
jeonsworld / ViT-pytorch
View on GitHub
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,158Jun 7, 2022Updated 4 years ago
microsoft / Cream
View on GitHub
This is a collection of our NAS and Vision Transformer work.
☆1,836Jul 25, 2024Updated last year
DingXiaoH / RepVGG
View on GitHub
RepVGG: Making VGG-style ConvNets Great Again
☆3,479Feb 10, 2023Updated 3 years ago
ChengyueGongR / PatchVisionTransformer
View on GitHub
☆74Dec 8, 2022Updated 3 years ago
szq0214 / MEAL-V2
View on GitHub
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks. In NeurIPS 2020 workshop.
☆700Dec 24, 2021Updated 4 years ago