abhinav-neil / clip-zs-promptingLinks
Using CLIP for zero-shot learning and image classification with text & visual prompting.
☆15Updated 2 years ago
Alternatives and similar repositories for clip-zs-prompting
Users that are interested in clip-zs-prompting are comparing it to the libraries listed below
Sorting:
- code for studying OpenAI's CLIP explainability☆33Updated 3 years ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆88Updated last year
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆170Updated last year
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆106Updated last year
- An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.☆130Updated 7 months ago
- FInetuning CLIP for Few Shot Learning☆45Updated 3 years ago
- A Survey on multimodal learning research.☆329Updated last year
- ☆18Updated 3 months ago
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆284Updated last year
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆194Updated 2 years ago
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆147Updated last year
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆77Updated 2 months ago
- [ICLR 2023] Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning"☆59Updated 2 years ago
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆78Updated last year
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆121Updated 2 years ago
- Code to train CLIP model☆117Updated 3 years ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆107Updated 2 months ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆73Updated 2 years ago
- [TPAMI] Searching prompt modules for parameter-efficient transfer learning.☆233Updated last year
- [CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval☆19Updated 4 months ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆76Updated 3 months ago
- Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".☆130Updated 9 months ago
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆58Updated last year
- ☆544Updated 3 years ago
- [CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation☆123Updated last year
- Visual self-questioning for large vision-language assistant.☆42Updated 3 weeks ago
- [NeurIPS 2023] Meta-Adapter☆49Updated last year
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆272Updated last year
- SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation☆117Updated last year
- Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.☆45Updated 3 years ago