benbergner / croprLinks
A token pruning method that accelerates ViTs for various tasks while maintaining high performance.
☆14Updated 5 months ago
Alternatives and similar repositories for cropr
Users that are interested in cropr are comparing it to the libraries listed below
Sorting:
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- ☆10Updated 3 months ago
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆45Updated 2 weeks ago
- ☆42Updated 7 months ago
- ☆15Updated 7 months ago
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆41Updated 9 months ago
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆39Updated 11 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- Official pytorch implementation of ZiRa, a method for incremental vision language object detection (IVLOD),which has been accepted by Neu…☆23Updated 8 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆40Updated last year
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Updated 2 years ago
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆32Updated last year
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆17Updated 11 months ago
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆23Updated 8 months ago
- ☆35Updated last year
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆29Updated last year
- 🔥 🔥 [WACV2024] Mini but Mighty: Finetuning ViTs with Mini Adapters☆20Updated 11 months ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆54Updated 7 months ago
- ☆23Updated 2 years ago
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆32Updated 8 months ago
- Lightweight Transformer for Multi-modal Tasks☆16Updated 2 years ago
- "Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024…☆10Updated 6 months ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Updated 2 years ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆22Updated 9 months ago
- ☆23Updated last year
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆36Updated 3 months ago
- Pytorch implementation of Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation (CVPR'24)☆26Updated last year
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆26Updated 3 months ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated last year
- Generating Image Specific Text☆28Updated last year