google-research / vision_transformerLinks
☆11,460Updated 3 months ago
Alternatives and similar repositories for vision_transformer
Users that are interested in vision_transformer are comparing it to the libraries listed below
Sorting:
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆14,911Updated 10 months ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆34,507Updated this week
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆23,145Updated 3 months ago
- Code release for ConvNeXt model☆6,027Updated 2 years ago
- Official DeiT repository☆4,216Updated last year
- End-to-End Object Detection with Transformers☆14,430Updated last year
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆4,998Updated last month
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆7,855Updated 10 months ago
- A PyTorch implementation of EfficientNet☆8,119Updated 3 years ago
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,493Updated 5 months ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆6,909Updated 11 months ago
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆8,966Updated 10 months ago
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆2,041Updated 3 years ago
- An open source implementation of CLIP.☆11,957Updated last week
- Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.☆4,240Updated 3 months ago
- SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners☆4,299Updated 2 years ago
- Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners☆2,654Updated last year
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,242Updated last year
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆29,407Updated 10 months ago
- OpenMMLab Pre-training Toolbox and Benchmark☆3,684Updated 7 months ago
- OpenMMLab Computer Vision Foundation☆6,160Updated last month
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆11,783Updated 2 months ago
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,266Updated last year
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,686Updated this week
- 🐍 Geometric Computer Vision Library for Spatial AI☆10,537Updated last week
- Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.☆10,577Updated this week
- RepVGG: Making VGG-style ConvNets Great Again☆3,394Updated 2 years ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆8,991Updated last month
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,566Updated last week
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆6,984Updated 6 months ago