mlpc-ucsd / CoaTLinks
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
β234Updated 3 years ago
Alternatives and similar repositories for CoaT
Users that are interested in CoaT are comparing it to the libraries listed below
Sorting:
- Implementation of the π Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbonesβ200Updated 4 years ago
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"β558Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"β291Updated 3 years ago
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.β157Updated 2 years ago
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Predictionβ386Updated 3 years ago
- ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)β209Updated last year
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".β289Updated 3 years ago
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorcβ¦β309Updated 3 years ago
- Boundary IoU API (Beta version)β229Updated last year
- [Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021β168Updated 3 years ago
- β249Updated 3 years ago
- Bottleneck Transformers for Visual Recognitionβ280Updated 4 years ago
- β140Updated 3 years ago
- β193Updated 2 years ago
- Self-supervised vIsion Transformer (SiT)β337Updated 2 years ago
- PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformersβ228Updated 4 years ago
- Code for the Convolutional Vision Transformer (ConViT)β470Updated 4 years ago
- β135Updated 2 years ago
- A PyTorch implementation of "MetaFormer: A Unified Meta Framework for Fine-Grained Recognition". A reference PyTorch implementation of βCβ¦β242Updated 3 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"β244Updated 2 years ago
- [TPAMI 2022 & CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformersβ488Updated 2 years ago
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)β192Updated 3 years ago
- EsViT: Efficient self-supervised Vision Transformersβ411Updated 2 years ago
- This is a PyTorch re-implementation of Axial-DeepLab (ECCV 2020 Spotlight)β456Updated 4 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"β433Updated 2 years ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"β369Updated last year
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.β227Updated 3 years ago
- Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021β355Updated 4 years ago
- Official PyTorch implementation of Fully Attentional Networksβ480Updated 2 years ago
- SOTR: Segmenting Objects with Transformersβ193Updated 4 years ago