yuhuan-wu / P2TLinks
[TPAMI22] Pyramid Pooling Transformer for Scene Understanding
☆209Updated 2 years ago
Alternatives and similar repositories for P2T
Users that are interested in P2T are comparing it to the libraries listed below
Sorting:
- ☆181Updated 4 months ago
- ☆216Updated 3 years ago
- Lite Vision Transformer (CVPR 2022)☆142Updated 2 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆285Updated 2 years ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆207Updated last year
- ☆142Updated 11 months ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆139Updated 2 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆336Updated last year
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆116Updated last year
- Official implement of "CAT: Cross Attention in Vision Transformer".☆160Updated 2 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆156Updated 3 years ago
- The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks☆219Updated last month
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆97Updated 2 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆353Updated 2 years ago
- iFormer: Inception Transformer☆247Updated 2 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆120Updated 3 years ago
- Pytorch Re-Implementation | Dynamic Region-Aware Convolution (ECCV2020)☆104Updated 4 years ago
- CEDNet: A Cascade Encoder-Decoder Network for Dense Prediction (Pattern Recognition 2024)☆122Updated 5 months ago
- [TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆199Updated last month
- Official MegEngine implementation of RepLKNet☆275Updated 3 years ago
- Code and models for mobile-former☆124Updated 2 years ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆192Updated 10 months ago
- ☆150Updated last year
- Official repository of ACmix (CVPR2022)☆415Updated 3 years ago
- ☆130Updated 2 years ago
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆126Updated 2 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆162Updated last year
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆87Updated 2 years ago
- ☆194Updated 2 years ago
- ☆84Updated last year