zhangxiaosong18 / hivit
☆59Updated last year
Alternatives and similar repositories for hivit:
Users that are interested in hivit are comparing it to the libraries listed below
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆185Updated 6 months ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆51Updated 7 months ago
- ☆85Updated last year
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆62Updated last month
- [NeurIPS 2023] Rank-DETR for High Quality Object Detection☆88Updated last year
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆83Updated last month
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆72Updated 4 months ago
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆32Updated 7 months ago
- ☆44Updated 9 months ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆72Updated 6 months ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆137Updated 2 years ago
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆91Updated 7 months ago
- Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"☆122Updated last year
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆59Updated 3 weeks ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆66Updated 4 months ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆46Updated last year
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆97Updated 9 months ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆51Updated 2 years ago
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆89Updated 8 months ago
- ☆72Updated last year
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Updated 2 years ago
- Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"☆88Updated last year
- Official Code for 'Referring Camouflaged Object Detection (指向性伪装物体检测) ' (TPAMI 2025)☆81Updated last month
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆194Updated last year
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆77Updated 5 months ago
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated 2 years ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆47Updated 6 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆182Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆103Updated last year