ZjjConan / VLM-MultiModalAdapterLinks
The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".
☆69Updated last month
Alternatives and similar repositories for VLM-MultiModalAdapter
Users that are interested in VLM-MultiModalAdapter are comparing it to the libraries listed below
Sorting:
- Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"☆36Updated 11 months ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆105Updated last year
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆73Updated last year
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆47Updated 10 months ago
- ☆24Updated last year
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆73Updated 11 months ago
- Easy wrapper for inserting LoRA layers in CLIP.☆32Updated 11 months ago
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆101Updated last year
- [CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"☆95Updated 10 months ago
- ☆47Updated last year
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆107Updated 5 months ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆79Updated 10 months ago
- [NeurIPS 2024] Code for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models☆41Updated 2 months ago
- ☆41Updated 3 months ago
- Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference☆37Updated 2 months ago
- ☆41Updated 3 weeks ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆74Updated 4 months ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆34Updated 5 months ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆73Updated 2 years ago
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆63Updated 4 months ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆30Updated 3 months ago
- cliptrase☆36Updated 9 months ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆165Updated last year
- The official implementation of CVPR 24' Paper "Learning Transferable Negative Prompts for Out-of-Distribution Detection"☆55Updated last year
- ☆17Updated 7 months ago
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.☆44Updated 7 months ago
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆68Updated last year
- ☆96Updated last year
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆32Updated last year
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆265Updated last year