yitu-opensource/T2T-ViT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yitu-opensource/T2T-ViT)

yitu-opensource / T2T-ViT

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

☆1,194

Alternatives and similar repositories for T2T-ViT

Users that are interested in T2T-ViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,359Mar 15, 2024Updated 2 years ago
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,901Oct 27, 2022Updated 3 years ago
zihangJiang / TokenLabeling
View on GitHub
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
☆436Sep 5, 2023Updated 2 years ago
sail-sg / volo
View on GitHub
VOLO: Vision Outlooker for Visual Recognition
☆948Sep 18, 2022Updated 3 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,008Jul 24, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
dk-liang / Awesome-Visual-Transformer
View on GitHub
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,590Jan 7, 2025Updated last year
lucidrains / bottleneck-transformer-pytorch
View on GitHub
Implementation of Bottleneck Transformer in Pytorch
☆677Sep 20, 2021Updated 4 years ago
blackfeather-wang / Dynamic-Vision-Transformer
View on GitHub
Accelerating T2t-ViT by 1.6-3.6x.
☆260Nov 25, 2021Updated 4 years ago
google-research / vision_transformer
View on GitHub
☆12,637Jul 9, 2026Updated last week
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,430Jun 22, 2026Updated last month
Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
fudan-zvg / SETR
View on GitHub
[CVPR 2021 & IJCV 2024] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
☆1,108Sep 2, 2024Updated last year
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆37,003Updated this week
lucidrains / lambda-networks
View on GitHub
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
☆1,528Nov 18, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fundamentalvision / Deformable-DETR
View on GitHub
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
☆4,002May 16, 2024Updated 2 years ago
d-li14 / involution
View on GitHub
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
☆1,311Jul 16, 2021Updated 5 years ago
raoyongming / DynamicViT
View on GitHub
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
☆668Jul 11, 2023Updated 3 years ago
DingXiaoH / RepVGG
View on GitHub
RepVGG: Making VGG-style ConvNets Great Again
☆3,479Feb 10, 2023Updated 3 years ago
facebookresearch / detr
View on GitHub
End-to-End Object Detection with Transformers
☆15,352Mar 12, 2024Updated 2 years ago
facebookresearch / LeViT
View on GitHub
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
☆624Aug 27, 2022Updated 3 years ago
jeonsworld / ViT-pytorch
View on GitHub
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,158Jun 7, 2022Updated 4 years ago
Meituan-AutoML / CPVT
View on GitHub
☆196Feb 14, 2023Updated 3 years ago
szq0214 / MEAL-V2
View on GitHub
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks. In NeurIPS 2020 workshop.
☆701Dec 24, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sail-sg / poolformer
View on GitHub
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,363Jun 1, 2024Updated 2 years ago
zhanghang1989 / ResNeSt
View on GitHub
ResNeSt: Split-Attention Networks
☆3,262Dec 9, 2022Updated 3 years ago
facebookresearch / moco
View on GitHub
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
☆5,138Feb 3, 2026Updated 5 months ago
VITA-Group / TransGAN
View on GitHub
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
☆1,693Nov 3, 2022Updated 3 years ago
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,414Jan 8, 2023Updated 3 years ago
naver-ai / pit
View on GitHub
☆245Jul 23, 2021Updated 5 years ago
hszhao / SAN
View on GitHub
Exploring Self-attention for Image Recognition, CVPR2020.
☆751Jun 15, 2020Updated 6 years ago
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,047Sep 29, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
facebookresearch / MaskFormer
View on GitHub
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
☆1,462Mar 11, 2022Updated 4 years ago
hila-chefer / Transformer-Explainability
View on GitHub
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …
☆2,007Jan 24, 2024Updated 2 years ago
facebookresearch / convit
View on GitHub
Code for the Convolutional Vision Transformer (ConViT)
☆474Oct 25, 2021Updated 4 years ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,368Jul 23, 2024Updated last year
houqb / VisionPermutator
View on GitHub
MLP-Like Vision Permutator for Visual Recognition (PyTorch)
☆192Mar 31, 2022Updated 4 years ago
ofsoundof / LocalViT
View on GitHub
☆118Jan 17, 2026Updated 6 months ago
WXinlong / DenseCL
View on GitHub
Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.
☆570Dec 26, 2023Updated 2 years ago