sail-sg / iFormer
iFormer: Inception Transformer
☆244Updated 2 years ago
Alternatives and similar repositories for iFormer:
Users that are interested in iFormer are comparing it to the libraries listed below
- ☆214Updated 3 years ago
- ☆170Updated 2 months ago
- Official repository of Slide-Transformer (CVPR2023)☆167Updated 6 months ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆270Updated last year
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆471Updated last year
- Lite Vision Transformer (CVPR 2022)☆139Updated 2 years ago
- ☆141Updated 6 months ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆373Updated 3 years ago
- Official MegEngine implementation of RepLKNet☆275Updated 2 years ago
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆207Updated last year
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆330Updated last year
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆282Updated 2 years ago
- Official repository of ACmix (CVPR2022)☆408Updated 2 years ago
- ☆83Updated last year
- Official ImageNet Model repository☆246Updated last year
- MetaFormer Baselines for Vision (TPAMI 2024)☆454Updated 9 months ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆157Updated 2 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆162Updated last year
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆347Updated last year
- ☆60Updated 3 years ago
- ☆197Updated 7 months ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆250Updated last year
- InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)☆283Updated 3 months ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆153Updated 3 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆497Updated 2 years ago
- GroupMixAttention and GroupMixFormer☆115Updated last year
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆196Updated 3 years ago
- ☆191Updated 2 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆184Updated 2 years ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆199Updated last year