lucidrains / vit-pytorchLinks
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
☆23,314Updated 4 months ago
Alternatives and similar repositories for vit-pytorch
Users that are interested in vit-pytorch are comparing it to the libraries listed below
Sorting:
- ☆11,545Updated 4 months ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆14,979Updated 11 months ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆34,654Updated this week
- End-to-End Object Detection with Transformers☆14,495Updated last year
- Code release for ConvNeXt model☆6,054Updated 2 years ago
- Official DeiT repository☆4,230Updated last year
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆11,894Updated 3 months ago
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆9,028Updated 11 months ago
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆7,888Updated 11 months ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆6,952Updated last year
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,500Updated 6 months ago
- Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125☆15,045Updated 2 weeks ago
- OpenMMLab Pre-training Toolbox and Benchmark☆3,708Updated 8 months ago
- A PyTorch implementation of EfficientNet☆8,132Updated 3 years ago
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆5,018Updated 2 weeks ago
- OpenMMLab Detection Toolbox and Benchmark☆31,311Updated 10 months ago
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆2,055Updated 3 years ago
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆4,897Updated 11 months ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆29,736Updated 11 months ago
- Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.☆10,638Updated this week
- Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.☆4,257Updated 3 months ago
- OpenMMLab Computer Vision Foundation☆6,183Updated 2 months ago
- Datasets, Transforms and Models specific to Computer Vision☆16,970Updated this week
- Pytorch implementation of convolutional neural network visualization techniques☆8,043Updated 6 months ago
- PyTorch implementation of the U-Net for image semantic segmentation with high quality images☆10,325Updated 11 months ago
- Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners☆2,660Updated last year
- 🐍 Geometric Computer Vision Library for Spatial AI☆10,591Updated this week
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,270Updated 2 years ago
- SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners☆4,314Updated 2 years ago
- 🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…☆12,010Updated 7 months ago