sunsmarterjie / iTPNLinks
(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling
☆208Updated last year
Alternatives and similar repositories for iTPN
Users that are interested in iTPN are comparing it to the libraries listed below
Sorting:
- ☆72Updated 2 years ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆213Updated 2 years ago
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆264Updated 2 years ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆59Updated last year
- ☆87Updated 2 years ago
- ☆149Updated last year
- Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"☆143Updated 9 months ago
- GroupMixAttention and GroupMixFormer☆116Updated last year
- CrossKD: Cross-Head Knowledge Distillation for Dense Object Detection☆192Updated 2 years ago
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆119Updated last year
- ☆186Updated 11 months ago
- ☆124Updated 2 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆217Updated 5 months ago
- ☆216Updated 3 years ago
- 'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)☆239Updated 2 years ago
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆73Updated last year
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated 2 years ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆86Updated 5 months ago
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆331Updated 10 months ago
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆196Updated 2 years ago
- [CVPR 2023] Official implementation of the paper "Semi-DETR: Semi-Supervised Object Detection with Detection Transformers"☆101Updated last year
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆34Updated 9 months ago
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆90Updated 2 years ago
- [ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection☆139Updated 8 months ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆292Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆85Updated 7 months ago
- [CVPR 2024] Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective☆20Updated last year
- ☆85Updated 2 years ago
- Official repository of Slide-Transformer (CVPR2023)☆173Updated last year
- ☆43Updated last year