dk-liang/Awesome-Visual-Transformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dk-liang/Awesome-Visual-Transformer)

dk-liang / Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

☆3,589

Alternatives and similar repositories for Awesome-Visual-Transformer

Users that are interested in Awesome-Visual-Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DirtyHarryLYL / Transformer-in-Vision
View on GitHub
Recent Transformer-based CV and related works.
☆1,344Aug 22, 2023Updated 2 years ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,447Jun 22, 2026Updated last month
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,019Jul 24, 2024Updated 2 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆37,024Updated this week
cmhungsteve / Awesome-Transformer-Attention
View on GitHub
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
☆5,051Jul 30, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Yangzhangcst / Transformer-in-Computer-Vision
View on GitHub
A paper list of some recent Transformer-based CV works.
☆1,460Nov 19, 2025Updated 8 months ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,357Mar 15, 2024Updated 2 years ago
google-research / vision_transformer
View on GitHub
☆12,650Updated this week
IDEA-Research / awesome-detection-transformer
View on GitHub
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
☆1,399Jul 4, 2024Updated 2 years ago
facebookresearch / detr
View on GitHub
End-to-End Object Detection with Transformers
☆15,361Mar 12, 2024Updated 2 years ago
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
yitu-opensource / T2T-ViT
View on GitHub
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,194Oct 27, 2023Updated 2 years ago
jason718 / awesome-self-supervised-learning
View on GitHub
A curated list of awesome self-supervised methods
☆6,407Feb 24, 2026Updated 5 months ago
lijiaman / awesome-transformer-for-vision
View on GitHub
☆280Mar 22, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
amusi / CVPR2026-Papers-with-Code
View on GitHub
CVPR 2026 论文和开源项目合集
☆22,771Mar 8, 2026Updated 4 months ago
fundamentalvision / Deformable-DETR
View on GitHub
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
☆4,006May 16, 2024Updated 2 years ago
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,417Jan 8, 2023Updated 3 years ago
extreme-assistant / CVPR2024-Paper-Code-Interpretation
View on GitHub
cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集，极市团队整理
☆12,479Apr 25, 2024Updated 2 years ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,375Jul 23, 2024Updated 2 years ago
alohays / awesome-visual-representation-learning-with-transformers
View on GitHub
Awesome Transformers (self-attention) in Computer Vision
☆271Jul 31, 2021Updated 4 years ago
open-mmlab / mmdetection
View on GitHub
OpenMMLab Detection Toolbox and Benchmark
☆32,852Aug 21, 2024Updated last year
diff-usion / Awesome-Diffusion-Models
View on GitHub
A collection of resources and papers on Diffusion Models
☆12,365Aug 1, 2024Updated last year
yuewang-cuhk / awesome-vision-language-pretraining-papers
View on GitHub
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
☆1,159Aug 19, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jacobgil / pytorch-grad-cam
View on GitHub
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…
☆12,938Jul 10, 2026Updated 2 weeks ago
fudan-zvg / SETR
View on GitHub
[CVPR 2021 & IJCV 2024] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
☆1,108Sep 2, 2024Updated last year
facebookresearch / dino
View on GitHub
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,610Jul 3, 2024Updated 2 years ago
open-mmlab / mmselfsup
View on GitHub
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
☆3,302Jun 25, 2023Updated 3 years ago
pengzhiliang / MAE-pytorch
View on GitHub
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
☆2,691Jul 25, 2023Updated 3 years ago
zhaoxin94 / awesome-domain-adaptation
View on GitHub
A collection of AWESOME things about domain adaptation
☆5,452Dec 8, 2025Updated 7 months ago
52CV / CVPR-2021-Papers
View on GitHub
☆2,534Apr 11, 2022Updated 4 years ago
facebookresearch / moco
View on GitHub
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
☆5,139Feb 3, 2026Updated 5 months ago
yzhuoning / Awesome-CLIP
View on GitHub
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
☆1,229Jun 28, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
facebookresearch / MaskFormer
View on GitHub
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
☆1,462Mar 11, 2022Updated 4 years ago
d-li14 / involution
View on GitHub
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
☆1,311Jul 16, 2021Updated 5 years ago
Lyken17 / pytorch-OpCounter
View on GitHub
Count the MACs / FLOPs of your PyTorch model.
☆5,078Jul 8, 2024Updated 2 years ago
awesome-NeRF / awesome-NeRF
View on GitHub
A curated list of awesome neural radiance fields papers
☆6,777Jan 6, 2025Updated last year
raoyongming / DynamicViT
View on GitHub
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
☆668Jul 11, 2023Updated 3 years ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,393Mar 16, 2026Updated 4 months ago
jeonsworld / ViT-pytorch
View on GitHub
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,159Jun 7, 2022Updated 4 years ago