yuhuan-wu / P2T
[TPAMI22] Pyramid Pooling Transformer for Scene Understanding
☆199Updated last year
Related projects ⓘ
Alternatives and complementary repositories for P2T
- ☆210Updated 2 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆280Updated 2 years ago
- ☆149Updated last year
- Lite Vision Transformer (CVPR 2022)☆134Updated 2 years ago
- This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆192Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆175Updated 3 months ago
- ☆128Updated 4 months ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆141Updated 2 years ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆137Updated 2 years ago
- CEDNet: A Cascade Encoder-Decoder Network for Dense Prediction (Pattern Recognition 2024)☆110Updated 3 months ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆321Updated 10 months ago
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆93Updated 2 years ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆114Updated last year
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆148Updated 3 years ago
- ☆123Updated 7 months ago
- iFormer: Inception Transformer☆242Updated last year
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆249Updated last year
- CMT: Convolutional Neural Networks Meet Vision Transformers☆119Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆254Updated 11 months ago
- TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆157Updated 11 months ago
- ☆120Updated last year
- ☆191Updated last year
- Code and models for mobile-former☆119Updated 2 years ago
- ☆79Updated last year
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆85Updated last year
- Masked Generative Distillation (ECCV 2022)☆212Updated 2 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆349Updated last year
- The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks☆214Updated 2 years ago
- Official repository of ACmix (CVPR2022)☆400Updated 2 years ago
- InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)☆249Updated 11 months ago