sMamooler / CLIP_Explainability
code for studying OpenAI's CLIP explainability
☆25Updated 2 years ago
Related projects: ⓘ
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆62Updated 7 months ago
- ☆85Updated 11 months ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆93Updated 7 months ago
- Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning" (published at ICLR 202…☆48Updated last year
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆60Updated 5 months ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆91Updated last year
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆51Updated 10 months ago
- Code and results accompanying our paper titled CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets☆53Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆65Updated last year
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆75Updated 5 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆62Updated 4 months ago
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆42Updated last year
- [CVPR' 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆35Updated last month
- ☆81Updated 9 months ago
- Awesome List of Vision Language Prompt Papers☆34Updated 10 months ago
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆104Updated 2 months ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆37Updated 11 months ago
- ☆32Updated last year
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆31Updated last month
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆48Updated 3 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆63Updated last month
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆137Updated 9 months ago
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆31Updated last month
- [CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension☆27Updated 5 months ago
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆44Updated 11 months ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆57Updated 3 months ago
- Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models' (ICML2024)☆18Updated 2 weeks ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆80Updated 9 months ago
- Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models☆75Updated 6 months ago
- This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…☆17Updated 3 months ago