Mxbonn / ltmpLinks
Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transformers to any desired size with minimal loss of accuracy.
☆17Updated last year
Alternatives and similar repositories for ltmp
Users that are interested in ltmp are comparing it to the libraries listed below
Sorting:
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆44Updated 2 years ago
- [CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"☆45Updated 9 months ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Updated 2 years ago
- ☆33Updated 3 years ago
- code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720☆56Updated last year
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆55Updated 3 years ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆53Updated 2 years ago
- Official code of "Generating Instance-level Prompts for Rehearsal-free Continual Learning (ICCV 2023)"☆43Updated 2 years ago
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆106Updated 2 years ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆61Updated 2 years ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Updated 3 years ago
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆35Updated 2 years ago
- ☆30Updated 2 years ago
- ☆55Updated 4 months ago
- ☆61Updated 2 years ago
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆104Updated last year
- Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation☆107Updated 10 months ago
- S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions☆50Updated 2 years ago
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆75Updated 2 years ago
- ☆84Updated 2 years ago
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Updated last year
- ☆43Updated 3 months ago
- Official Pytorch implementation of Dynamic-Token-Pruning (ICCV2023)☆21Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Updated last year
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆83Updated 3 years ago
- ☆26Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆108Updated 2 years ago
- A PyTorch implementation of Multimodal Few-Shot Learning with Frozen Language Models with OPT.☆44Updated 3 years ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Updated 2 years ago
- Compress conventional Vision-Language Pre-training data☆53Updated 2 years ago