mlpc-ucsd / CoaTLinks
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
☆232Updated 3 years ago
Alternatives and similar repositories for CoaT
Users that are interested in CoaT are comparing it to the libraries listed below
Sorting:
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆289Updated 3 years ago
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆558Updated 3 years ago
- Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones☆200Updated 4 years ago
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction☆379Updated 3 years ago
- A PyTorch implementation of "MetaFormer: A Unified Meta Framework for Fine-Grained Recognition". A reference PyTorch implementation of “C…☆237Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆287Updated 2 years ago
- ☆192Updated 2 years ago
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.☆155Updated 2 years ago
- PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers☆226Updated 4 years ago
- ☆248Updated 3 years ago
- ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)☆208Updated last year
- Bottleneck Transformers for Visual Recognition☆279Updated 4 years ago
- Boundary IoU API (Beta version)☆227Updated last year
- [Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021☆167Updated 2 years ago
- ☆199Updated last year
- Official PyTorch implementation of Fully Attentional Networks☆479Updated 2 years ago
- ☆134Updated 2 years ago
- SOTR: Segmenting Objects with Transformers☆193Updated 3 years ago
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆192Updated 3 years ago
- EsViT: Efficient self-supervised Vision Transformers☆413Updated last year
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆227Updated 3 years ago
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…☆276Updated 2 years ago
- Self-supervised vIsion Transformer (SiT)☆338Updated 2 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆163Updated 2 years ago
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…☆305Updated 3 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" (CVPR 2022)☆200Updated 2 years ago
- ☆139Updated 3 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆243Updated 2 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆433Updated last year
- Implementation of Pixel-level Contrastive Learning, proposed in the paper "Propagate Yourself", in Pytorch☆260Updated 4 years ago