sunsmarterjie / iTPN
(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling
☆183Updated 5 months ago
Alternatives and similar repositories for iTPN:
Users that are interested in iTPN are comparing it to the libraries listed below
- ☆58Updated last year
- This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆197Updated last year
- ☆211Updated 3 years ago
- ☆83Updated last year
- ☆132Updated 6 months ago
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆204Updated last year
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆255Updated last year
- [CVPR 2023] Official implementation of the paper "Semi-DETR: Semi-Supervised Object Detection with Detection Transformers"☆84Updated last month
- GroupMixAttention and GroupMixFormer☆114Updated last year
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- CrossKD: Cross-Head Knowledge Distillation for Dense Object Detection☆144Updated last year
- ☆83Updated last year
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆89Updated 6 months ago
- vHeat: Building Vision Models upon Heat Conduction☆102Updated 7 months ago
- ☆168Updated 2 weeks ago
- [CVPR 2023] Token Contrast for Weakly-Supervised Semantic Segmentation☆157Updated last year
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆50Updated 6 months ago
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆193Updated last year
- This is the repository for TNNLS paper: "Unihead: unifying multi-perception for detection heads"☆12Updated this week
- ☆104Updated last year
- Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"☆120Updated last year
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆95Updated 2 years ago
- TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆170Updated last year
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated 9 months ago
- ☆245Updated 2 years ago
- [CVPR 2022 Oral] AdaMixer: A Fast-Converging Query-Based Object Detector☆234Updated 2 years ago
- ☆116Updated last year
- [CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"☆188Updated last year
- ☆44Updated 8 months ago
- ☆169Updated 3 years ago