cheerss / CrossFormer
The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI
☆382Updated last year
Alternatives and similar repositories for CrossFormer:
Users that are interested in CrossFormer are comparing it to the libraries listed below
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆250Updated last year
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆471Updated last year
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆531Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆282Updated 2 years ago
- iFormer: Inception Transformer☆244Updated 2 years ago
- ☆214Updated 3 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆330Updated last year
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆376Updated 3 years ago
- Official MegEngine implementation of RepLKNet☆275Updated 2 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆598Updated 2 years ago
- ☆190Updated 2 years ago
- Bottleneck Transformers for Visual Recognition☆278Updated 4 years ago
- unofficial implementation of CondConv: Conditionally Parameterized Convolutions for Efficient Inference in PyTorch.☆157Updated last year
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆226Updated 2 years ago
- Official repository of ACmix (CVPR2022)☆409Updated 2 years ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification☆598Updated last year
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆563Updated last year
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆191Updated 3 years ago
- [ICCV 2021] Code for approximated exponential maximum pooling☆294Updated 2 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆427Updated last year
- Code and models for mobile-former☆124Updated 2 years ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆116Updated last year
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆153Updated 3 years ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆348Updated last year
- Official implement of "CAT: Cross Attention in Vision Transformer".☆157Updated 2 years ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆196Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆498Updated 2 years ago
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆207Updated last year
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆554Updated 3 years ago
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆97Updated 2 years ago