lucidrains/vit-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/vit-pytorch)

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

☆25,413

Alternatives and similar repositories for vit-pytorch

Users that are interested in vit-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,986Updated this week
google-research / vision_transformer
View on GitHub
☆12,626Jul 9, 2026Updated last week
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆15,996Jul 24, 2024Updated last year
facebookresearch / detr
View on GitHub
End-to-End Object Detection with Transformers
☆15,336Mar 12, 2024Updated 2 years ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,357Mar 15, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,364Jul 23, 2024Updated last year
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆33,994Mar 25, 2026Updated 3 months ago
open-mmlab / mmdetection
View on GitHub
OpenMMLab Detection Toolbox and Benchmark
☆32,813Aug 21, 2024Updated last year
dk-liang / Awesome-Visual-Transformer
View on GitHub
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,587Jan 7, 2025Updated last year
jacobgil / pytorch-grad-cam
View on GitHub
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…
☆12,913Updated this week
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,413Jan 8, 2023Updated 3 years ago
facebookresearch / dino
View on GitHub
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,600Jul 3, 2024Updated 2 years ago
facebookresearch / segment-anything
View on GitHub
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…
☆54,550Sep 18, 2024Updated last year
jeonsworld / ViT-pytorch
View on GitHub
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,157Jun 7, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
facebookresearch / detectron2
View on GitHub
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
☆34,599Jun 7, 2026Updated last month
amusi / CVPR2026-Papers-with-Code
View on GitHub
CVPR 2026 论文和开源项目合集
☆22,748Mar 8, 2026Updated 4 months ago
facebookresearch / moco
View on GitHub
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
☆5,139Feb 3, 2026Updated 5 months ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆13,986Updated this week
arogozhnikov / einops
View on GitHub
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
☆9,550Jul 5, 2026Updated last week
lucidrains / denoising-diffusion-pytorch
View on GitHub
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
☆10,636Feb 11, 2026Updated 5 months ago
lucidrains / x-transformers
View on GitHub
A concise but complete full-attention transformer with a set of promising experimental features from various papers
☆5,919Jul 3, 2026Updated last week
huggingface / transformers
View on GitHub
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…
☆162,626Updated this week
albumentations-team / albumentations
View on GitHub
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
☆15,317Jun 25, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Lightning-AI / pytorch-lightning
View on GitHub
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
☆31,235Updated this week
pengzhiliang / MAE-pytorch
View on GitHub
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
☆2,691Jul 25, 2023Updated 2 years ago
eriklindernoren / PyTorch-GAN
View on GitHub
PyTorch implementations of Generative Adversarial Networks.
☆17,462Jun 18, 2024Updated 2 years ago
lukemelas / EfficientNet-PyTorch
View on GitHub
A PyTorch implementation of EfficientNet
☆8,222Apr 8, 2022Updated 4 years ago
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,109Jun 3, 2026Updated last month
xmu-xiaoma666 / External-Attention-pytorch
View on GitHub
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…
☆12,179Mar 16, 2026Updated 4 months ago
microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,159Jan 23, 2026Updated 5 months ago
kornia / kornia
View on GitHub
🐍 Geometric Computer Vision Library for Spatial AI
☆11,276Updated this week
open-mmlab / mmsegmentation
View on GitHub
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
☆9,877Aug 13, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
qubvel-org / segmentation_models.pytorch
View on GitHub
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
☆11,659Updated this week
labmlai / annotated_deep_learning_paper_implementations
View on GitHub
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, fee…
☆67,102Jan 22, 2026Updated 5 months ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,391Mar 16, 2026Updated 4 months ago
huggingface / diffusers
View on GitHub
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
☆34,064Updated this week
CompVis / latent-diffusion
View on GitHub
High-Resolution Image Synthesis with Latent Diffusion Models
☆14,104Feb 29, 2024Updated 2 years ago
pytorch / vision
View on GitHub
Datasets, Transforms and Models specific to Computer Vision
☆17,813Updated this week
diff-usion / Awesome-Diffusion-Models
View on GitHub
A collection of resources and papers on Diffusion Models
☆12,352Aug 1, 2024Updated last year