The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".
☆95Apr 24, 2025Updated 10 months ago
Alternatives and similar repositories for VLM-MultiModalAdapter
Users that are interested in VLM-MultiModalAdapter are comparing it to the libraries listed below
Sorting:
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆84May 24, 2024Updated last year
- [CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"☆115Jul 15, 2024Updated last year
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆755Dec 1, 2025Updated 3 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆91Jul 4, 2024Updated last year
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆109Nov 24, 2025Updated 3 months ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆32Mar 10, 2025Updated 11 months ago
- [CVPR'24] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP)…☆81Jun 7, 2025Updated 8 months ago
- [NeurIPS 2024] Code for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models☆46Mar 14, 2025Updated 11 months ago
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆285Sep 28, 2023Updated 2 years ago
- ☆105Dec 7, 2023Updated 2 years ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆348Dec 14, 2025Updated 2 months ago
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation☆64Feb 20, 2026Updated last week
- ☆24Jun 12, 2024Updated last year
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆807Jul 24, 2023Updated 2 years ago
- [NeurIPS 2024] WATT: Weight Average Test-Time Adaptation of CLIP☆56Sep 26, 2024Updated last year
- The official implementation of paper Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. If you find our code or paper use…☆50Jul 18, 2023Updated 2 years ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆110Feb 11, 2024Updated 2 years ago
- An easy way to apply LoRA to CLIP. Implementation of the paper "Low-Rank Few-Shot Adaptation of Vision-Language Models" (CLIP-LoRA) [CVPR…☆285Jun 6, 2025Updated 8 months ago
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆61Mar 24, 2025Updated 11 months ago
- ☆21Dec 15, 2025Updated 2 months ago
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆49Mar 13, 2025Updated 11 months ago
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆28Jan 26, 2025Updated last year
- This repository lists some awesome public projects about Zero-shot/Few-shot Learning based on CLIP (Contrastive Language-Image Pre-Traini…☆27Nov 28, 2024Updated last year
- [CVPR 2024 Highlight] ImageNet-D☆47Oct 15, 2024Updated last year
- [CVPR 2025 & IJCV2026] Official PyTorch Code for "MMRL: Multi-Modal Representation Learning for Vision-Language Models" and its extension…☆97Feb 5, 2026Updated 3 weeks ago
- Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"☆41Jul 1, 2024Updated last year
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated 11 months ago
- Official pytorch implementation of ZiRa, a method for incremental vision language object detection (IVLOD),which has been accepted by Neu…☆28Oct 22, 2024Updated last year
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆55Aug 16, 2024Updated last year
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"☆52Jun 16, 2025Updated 8 months ago
- Noise Contrastive Test-Time Training☆12Mar 11, 2024Updated last year
- Official code for "IT³: Idempotent Test-Time Training" (ICML 2025)☆14Jun 25, 2025Updated 8 months ago
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.☆50Jul 10, 2025Updated 7 months ago
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆121Dec 17, 2024Updated last year
- (NeurIPS 2024)Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models☆15Jul 18, 2025Updated 7 months ago
- ☆13Jul 17, 2024Updated last year
- The official implantation of SGPT (CVPR2024)☆17Jul 15, 2024Updated last year
- Python code to implement DeIL, a CLIP based approach for open-world few-shot learning.☆18Nov 4, 2024Updated last year
- One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models☆58Dec 20, 2024Updated last year