tahmid0007 / VisualTransformers
A Pytorch Implementation of the following paper "Visual Transformers: Token-based Image Representation and Processing for Computer Vision"
☆182Updated 4 years ago
Alternatives and similar repositories for VisualTransformers
Users that are interested in VisualTransformers are comparing it to the libraries listed below
Sorting:
- A complete easy to follow implementation of Google's Vision Transformer proposed in "AN IMAGE IS WORTH 16X16 WORDS". This pytorch impleme…☆96Updated 4 years ago
- PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers☆225Updated 3 years ago
- Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones☆199Updated 4 years ago
- ☆245Updated 3 years ago
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆190Updated 3 years ago
- A Pytorch implementation for the paper Local Relational Networks for Image Recognition (https://arxiv.org/pdf/1904.11491.pdf)☆113Updated 3 years ago
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…☆304Updated 3 years ago
- Code for “Disentangled Non-local Neural Networks”☆108Updated 4 years ago
- Deforming kernels to adapt towards object deformation. In ICLR 2020.☆198Updated 5 years ago
- Pytorch version of Vision Transformer (ViT) with pretrained models. This is part of CASL (https://casl-project.github.io/) and ASYML proj…☆352Updated 4 years ago
- An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale☆295Updated 3 years ago
- ☆245Updated 3 years ago
- ☆98Updated 3 years ago
- ☆138Updated 3 years ago
- This is a PyTorch re-implementation of Axial-DeepLab (ECCV 2020 Spotlight)☆453Updated 3 years ago
- ☆190Updated 2 years ago
- Resolution adaptive network☆152Updated 2 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆153Updated 3 years ago
- Code for ECCV 2020 paper (oral): Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation☆161Updated 4 years ago
- Awesome Transformers (self-attention) in Computer Vision☆270Updated 3 years ago
- Disentangled Non-Local Neural Networks☆81Updated 4 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆156Updated 3 years ago
- Bottleneck Transformers for Visual Recognition☆278Updated 4 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆251Updated 3 years ago
- [ECCV2020] Knowledge Distillation Meets Self-Supervision☆236Updated 2 years ago
- Global Reasoning module for visual recognition☆205Updated 3 years ago
- Self-supervised vIsion Transformer (SiT)☆330Updated 2 years ago
- Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)☆221Updated 2 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆427Updated last year
- This repo contains the code of "ConTNet: Why not use convolution and transformer at the same time?"☆99Updated 3 years ago