yuhuan-wu / P2T
[TPAMI22] Pyramid Pooling Transformer for Scene Understanding
☆209Updated last year
Alternatives and similar repositories for P2T:
Users that are interested in P2T are comparing it to the libraries listed below
- ☆215Updated 3 years ago
- Lite Vision Transformer (CVPR 2022)☆142Updated 2 years ago
- ☆176Updated 4 months ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆200Updated last year
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆282Updated 2 years ago
- ☆138Updated 10 months ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆139Updated 2 years ago
- iFormer: Inception Transformer☆246Updated 2 years ago
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆97Updated 2 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆333Updated last year
- Official implement of "CAT: Cross Attention in Vision Transformer".☆159Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆277Updated last year
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆156Updated 3 years ago
- ☆146Updated last year
- CMT: Convolutional Neural Networks Meet Vision Transformers☆119Updated 3 years ago
- Code and models for mobile-former☆123Updated 2 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆351Updated 2 years ago
- CEDNet: A Cascade Encoder-Decoder Network for Dense Prediction (Pattern Recognition 2024)☆122Updated 4 months ago
- Official ImageNet Model repository☆250Updated 2 years ago
- [TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆190Updated 2 weeks ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- [ICCV 2023] Official PyTorch implementation of "Rethinking Mobile Block for Efficient Attention-based Models"☆240Updated last year
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆116Updated last year
- ☆144Updated last year
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆86Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆191Updated 9 months ago
- Official repository of ACmix (CVPR2022)☆412Updated 3 years ago
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆126Updated 2 years ago
- ☆142Updated 8 months ago
- Official repository of Slide-Transformer (CVPR2023)☆169Updated 8 months ago