Vision Transformer (ViT) in PyTorch
☆847Mar 2, 2022Updated 4 years ago
Alternatives and similar repositories for PyTorch-Pretrained-ViT
Users that are interested in PyTorch-Pretrained-ViT are comparing it to the libraries listed below
Sorting:
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆2,123Jun 7, 2022Updated 3 years ago
- ☆12,332Updated this week
- Pytorch version of Vision Transformer (ViT) with pretrained models. This is part of CASL (https://casl-project.github.io/) and ASYML proj…☆360Nov 23, 2020Updated 5 years ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,420Feb 26, 2026Updated last week
- Official DeiT repository☆4,325Mar 15, 2024Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,721Jul 24, 2024Updated last year
- Explainability for Vision Transformers☆1,068Mar 12, 2022Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …☆1,976Jan 24, 2024Updated 2 years ago
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,565Jan 7, 2025Updated last year
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,230Jul 23, 2024Updated last year
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,192Oct 27, 2023Updated 2 years ago
- Recent Transformer-based CV and related works.☆1,340Aug 22, 2023Updated 2 years ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,459Jul 3, 2024Updated last year
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆5,116Feb 3, 2026Updated last month
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,643Apr 7, 2025Updated 10 months ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆729Aug 8, 2023Updated 2 years ago
- Official repository for "Self-Distilled Vision Transformer for Domain Generalization" (ACCV-2022 ORAL)☆42Dec 2, 2022Updated 3 years ago
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,296Jun 25, 2023Updated 2 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,026Sep 29, 2022Updated 3 years ago
- [CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers☆757Jul 15, 2021Updated 4 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,707Feb 18, 2026Updated 2 weeks ago
- End-to-End Object Detection with Transformers☆15,134Mar 12, 2024Updated last year
- A Pytorch-Lightning implementation of self-supervised algorithms☆544Apr 13, 2022Updated 3 years ago
- Knowledge Distillation: CVPR2020 Oral, Revisiting Knowledge Distillation via Label Smoothing Regularization☆585Feb 15, 2023Updated 3 years ago
- Code release for ConvNeXt model☆6,302Jan 8, 2023Updated 3 years ago
- Official implementation of PVT series☆1,888Oct 27, 2022Updated 3 years ago
- Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners☆2,687Jul 25, 2023Updated 2 years ago
- ☆43Jun 1, 2023Updated 2 years ago
- RepVGG: Making VGG-style ConvNets Great Again☆3,459Feb 10, 2023Updated 3 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆433Sep 5, 2023Updated 2 years ago
- PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)☆3,412Dec 26, 2023Updated 2 years ago
- ☆23Oct 29, 2020Updated 5 years ago
- Exploring Self-attention for Image Recognition, CVPR2020.☆752Jun 15, 2020Updated 5 years ago
- Vision Longformer For Object Detection☆34May 17, 2021Updated 4 years ago
- SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners☆4,458May 22, 2023Updated 2 years ago
- ☆14May 26, 2023Updated 2 years ago
- Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).☆122Dec 3, 2022Updated 3 years ago
- A PyTorch implementation of EfficientNet☆8,221Apr 8, 2022Updated 3 years ago