FarinaMatteo / rethinking_fewshot_vlms
[CVPR '25] Official implementation of the paper "Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages", accepted at (and to appear in) CVPR 2025.
☆9Updated this week
Alternatives and similar repositories for rethinking_fewshot_vlms:
Users that are interested in rethinking_fewshot_vlms are comparing it to the libraries listed below
- [ECCV 2024 Oral] Official implementation of the paper "DEVIAS: Learning Disentangled Video Representations of Action and Scene"☆18Updated 5 months ago
- Code for CVPR2025 "MMRL: Multi-Modal Representation Learning for Vision-Language Models".☆20Updated 2 weeks ago
- a training-free approach to accelerate ViTs and VLMs by pruning redundant tokens based on similarity☆11Updated this week
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆16Updated last week
- [CVPR-25🔥] Test-time Counterattacks (TTC) towards adversarial robustness of CLIP☆20Updated 3 weeks ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆31Updated 2 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 3 months ago
- Official Repository of Personalized Visual Instruct Tuning☆28Updated 3 weeks ago
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆23Updated last month
- [CVPR '24] Official implementation of the paper "Multiflow: Shifting Towards Task-Agnostic Vision-Language Pruning".☆21Updated 3 weeks ago
- ☆20Updated 11 months ago
- [CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning☆14Updated last week
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆20Updated last year
- ☆40Updated 4 months ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated 10 months ago
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆23Updated last week
- [NeurIPS2024] An official pytorch implement of the paper: BoostAdapter: Improving Test-Time Adaptation via Regional Bootstrapping☆14Updated 3 weeks ago
- ☆17Updated last month
- ☆38Updated last year
- Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆14Updated 4 months ago
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆19Updated 6 months ago
- Continual Forgetting for Pre-trained Vision Models (CVPR 2024)☆62Updated 2 months ago
- ☆23Updated 9 months ago
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆29Updated 6 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆53Updated 7 months ago
- This repository houses the code for the paper - "The Neglected of VLMs"☆28Updated 3 months ago
- Easy wrapper for inserting LoRA layers in CLIP.☆31Updated 9 months ago
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆14Updated 11 months ago
- Code for ECCV 2022 paper “Learning with Recoverable Forgetting”☆21Updated 2 years ago
- Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"☆20Updated this week