mlpc-ucsd / CoaTLinks
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
β233Updated 3 years ago
Alternatives and similar repositories for CoaT
Users that are interested in CoaT are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"β556Updated 3 years ago
- Implementation of the π Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbonesβ199Updated 4 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"β289Updated 3 years ago
- ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)β208Updated last year
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.β155Updated 2 years ago
- [Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021β167Updated 2 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".β287Updated 2 years ago
- β248Updated 3 years ago
- β199Updated last year
- β139Updated 3 years ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.β228Updated 3 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"β432Updated last year
- Self-supervised vIsion Transformer (SiT)β337Updated 2 years ago
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Predictionβ379Updated 3 years ago
- PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformersβ226Updated 4 years ago
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorcβ¦β305Updated 3 years ago
- Code for the Convolutional Vision Transformer (ConViT)β466Updated 3 years ago
- EsViT: Efficient self-supervised Vision Transformersβ413Updated last year
- β192Updated 2 years ago
- Official PyTorch implementation of Fully Attentional Networksβ479Updated 2 years ago
- β134Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"β242Updated 2 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].β163Updated 2 years ago
- [TPAMI 2022 & CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformersβ486Updated 2 years ago
- A PyTorch implementation of "MetaFormer: A Unified Meta Framework for Fine-Grained Recognition". A reference PyTorch implementation of βCβ¦β236Updated 3 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" (CVPR 2022)β199Updated 2 years ago
- Boundary IoU API (Beta version)β226Updated 11 months ago
- This is a PyTorch re-implementation of Axial-DeepLab (ECCV 2020 Spotlight)β457Updated 4 years ago
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)β191Updated 3 years ago
- SOTR: Segmenting Objects with Transformersβ193Updated 3 years ago