yuhuan-wu / P2T
[TPAMI22] Pyramid Pooling Transformer for Scene Understanding
☆207Updated last year
Alternatives and similar repositories for P2T:
Users that are interested in P2T are comparing it to the libraries listed below
- ☆214Updated 3 years ago
- ☆170Updated 2 months ago
- Lite Vision Transformer (CVPR 2022)☆139Updated 2 years ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆116Updated last year
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆198Updated last year
- ☆136Updated 8 months ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆282Updated 2 years ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆139Updated 2 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆157Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆270Updated last year
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆153Updated 3 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆351Updated 2 years ago
- Code and models for mobile-former☆123Updated 2 years ago
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆85Updated last year
- iFormer: Inception Transformer☆244Updated 2 years ago
- [TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆172Updated last week
- ☆193Updated 2 years ago
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆97Updated 2 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆119Updated 3 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆328Updated last year
- ☆128Updated 2 years ago
- CEDNet: A Cascade Encoder-Decoder Network for Dense Prediction (Pattern Recognition 2024)☆121Updated 3 months ago
- ☆143Updated last year
- Official ImageNet Model repository☆246Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆189Updated 7 months ago
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆126Updated 2 years ago
- Official repository of ACmix (CVPR2022)☆407Updated 2 years ago
- Pytorch code for CVPR2021 paper "Learning Statistical Texture for Semantic Segmentation"☆94Updated 3 years ago
- Official repository of Slide-Transformer (CVPR2023)☆167Updated 6 months ago
- Official MegEngine implementation of RepLKNet☆275Updated 2 years ago