zhangxiaosong18 / hivit
☆58Updated last year
Alternatives and similar repositories for hivit:
Users that are interested in hivit are comparing it to the libraries listed below
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆183Updated 6 months ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆50Updated 6 months ago
- ☆84Updated last year
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆81Updated 2 weeks ago
- Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"☆122Updated last year
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆59Updated this week
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆72Updated 5 months ago
- ☆44Updated 9 months ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆60Updated last month
- [ICCV2023] DETRDistill: A Universal Knowledge Distillation Framework for DETR-families☆49Updated last year
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆71Updated 3 months ago
- [NeurIPS 2023] Rank-DETR for High Quality Object Detection☆88Updated last year
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆68Updated last year
- Official Code for 'Referring Camouflaged Object Detection (指向性伪装物体检测) ' (TPAMI 2025)☆79Updated 2 weeks ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆77Updated 5 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆181Updated 11 months ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆72Updated 2 years ago
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆177Updated last year
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆51Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆103Updated last year
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆137Updated 2 years ago
- [CVPR2023] This is an official mmdet implementation of paper "DETRs with Hybrid Matching".☆48Updated 2 years ago
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆31Updated 7 months ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆52Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆74Updated 5 months ago
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated 2 years ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated 9 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆68Updated 7 months ago
- ☆32Updated last year