google-research / vision_transformerLinks
☆11,649Updated 5 months ago
Alternatives and similar repositories for vision_transformer
Users that are interested in vision_transformer are comparing it to the libraries listed below
Sorting:
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,080Updated last year
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆23,549Updated last week
- Official DeiT repository☆4,246Updated last year
- Code release for ConvNeXt model☆6,085Updated 2 years ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆34,953Updated this week
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,039Updated last year
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆5,033Updated last month
- End-to-End Object Detection with Transformers☆14,590Updated last year
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆7,937Updated last year
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆2,069Updated 3 years ago
- Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.☆4,277Updated 4 months ago
- SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners☆4,337Updated 2 years ago
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,511Updated 7 months ago
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,025Updated 4 months ago
- A PyTorch implementation of EfficientNet☆8,142Updated 3 years ago
- Deformable DETR: Deformable Transformers for End-to-End Object Detection.☆3,657Updated last year
- Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners☆2,662Updated 2 years ago
- OpenMMLab Pre-training Toolbox and Benchmark☆3,729Updated 9 months ago
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆9,132Updated 11 months ago
- OpenMMLab Computer Vision Foundation☆6,216Updated 3 months ago
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,276Updated 2 years ago
- Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.☆10,768Updated this week
- Datasets, Transforms and Models specific to Computer Vision☆17,056Updated this week
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆4,913Updated last year
- An open source implementation of CLIP.☆12,360Updated this week
- PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations☆2,424Updated last year
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,051Updated 8 months ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,626Updated this week
- Count the MACs / FLOPs of your PyTorch model.☆5,029Updated last year
- RepVGG: Making VGG-style ConvNets Great Again☆3,412Updated 2 years ago