sail-sg / iFormerLinks
iFormer: Inception Transformer
☆246Updated 2 years ago
Alternatives and similar repositories for iFormer
Users that are interested in iFormer are comparing it to the libraries listed below
Sorting:
- Official implement of "CAT: Cross Attention in Vision Transformer".☆163Updated 3 years ago
- Official repository of Slide-Transformer (CVPR2023)☆172Updated last year
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆286Updated last year
- Lite Vision Transformer (CVPR 2022)☆144Updated 2 years ago
- ☆216Updated 3 years ago
- ☆151Updated last year
- Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLP, S2MLPv2, RaftMLP, HireMLP, ConvMLP, AS-MLP, SparseMLP, Co…☆169Updated 3 years ago
- Official ImageNet Model repository☆253Updated 2 years ago
- ☆184Updated 8 months ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆400Updated 3 years ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆363Updated last year
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆486Updated 2 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆343Updated last year
- ☆63Updated 4 years ago
- Sequencer: Deep LSTM for Image Classification☆143Updated 2 years ago
- ☆85Updated 2 years ago
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆99Updated 3 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆163Updated 2 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆214Updated 2 months ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆203Updated 4 years ago
- ☆146Updated last year
- ☆199Updated last year
- GroupMixAttention and GroupMixFormer☆117Updated last year
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆251Updated 2 years ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆211Updated 2 years ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆140Updated 3 years ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆489Updated last year
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆105Updated 2 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆121Updated 3 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆394Updated last year