sail-sg / iFormerLinks
iFormer: Inception Transformer
☆247Updated 2 years ago
Alternatives and similar repositories for iFormer
Users that are interested in iFormer are comparing it to the libraries listed below
Sorting:
- Official repository of Slide-Transformer (CVPR2023)☆172Updated 11 months ago
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆485Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆285Updated last year
- Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLP, S2MLPv2, RaftMLP, HireMLP, ConvMLP, AS-MLP, SparseMLP, Co…☆170Updated 3 years ago
- ☆215Updated 3 years ago
- ☆149Updated 11 months ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆340Updated last year
- Official implement of "CAT: Cross Attention in Vision Transformer".☆162Updated 3 years ago
- ☆184Updated 7 months ago
- Lite Vision Transformer (CVPR 2022)☆144Updated 2 years ago
- Official ImageNet Model repository☆252Updated 2 years ago
- Sequencer: Deep LSTM for Image Classification☆143Updated 2 years ago
- ☆85Updated last year
- GroupMixAttention and GroupMixFormer☆117Updated last year
- ☆145Updated last year
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆400Updated 3 years ago
- ☆63Updated 3 years ago
- ☆199Updated last year
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆163Updated 2 years ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆362Updated last year
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆204Updated 4 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆508Updated 2 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆395Updated last year
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆210Updated 2 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆213Updated 2 months ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆251Updated 2 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆287Updated 2 years ago
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆98Updated 3 years ago
- InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)☆317Updated 8 months ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆480Updated last year