sunsmarterjie / iTPN
(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling
☆188Updated 6 months ago
Alternatives and similar repositories for iTPN:
Users that are interested in iTPN are comparing it to the libraries listed below
- ☆59Updated last year
- CrossKD: Cross-Head Knowledge Distillation for Dense Object Detection☆149Updated last year
- ☆85Updated last year
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆51Updated 7 months ago
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆258Updated last year
- GroupMixAttention and GroupMixFormer☆115Updated last year
- [CVPR 2023] Official implementation of the paper "Semi-DETR: Semi-Supervised Object Detection with Detection Transformers"☆84Updated 2 months ago
- ☆133Updated 7 months ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆198Updated last year
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆266Updated last month
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆205Updated last year
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆137Updated 2 years ago
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆88Updated 8 months ago
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆72Updated 4 months ago
- ☆170Updated last month
- 'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)☆224Updated last year
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆79Updated last year
- [NeurIPS 2023] Rank-DETR for High Quality Object Detection☆89Updated last year
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆91Updated 7 months ago
- PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆85Updated 3 months ago
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆195Updated last year
- ☆117Updated last year
- ☆104Updated last year
- TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆170Updated last year
- Code release for "Active Teacher for Semi-Supervised Object Detection", CVPR2022☆83Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆183Updated last year
- Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"☆122Updated last year
- ☆44Updated 10 months ago
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- ☆213Updated 3 years ago