GingL / CMPA
☆14Updated last year
Alternatives and similar repositories for CMPA:
Users that are interested in CMPA are comparing it to the libraries listed below
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆31Updated last year
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆74Updated 9 months ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆50Updated 6 months ago
- ☆38Updated 9 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆67Updated 3 weeks ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆31Updated 10 months ago
- Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels [CVPR 2023]☆12Updated last year
- [AAAI2024] Official implementation of the AAAI 2024 paper TGP-T☆28Updated 9 months ago
- ☆24Updated 10 months ago
- ☆89Updated last year
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆54Updated last year
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆90Updated 2 months ago
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆96Updated 10 months ago
- [CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension☆44Updated 9 months ago
- AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval☆15Updated 5 months ago
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆31Updated 6 months ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆45Updated 2 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆68Updated 6 months ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆13Updated last year
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆43Updated 5 months ago
- [ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning☆45Updated 8 months ago
- ☆88Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆68Updated last year
- Multimodal-Composite-Editing-and-Retrieval-update☆28Updated 3 months ago
- Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024☆29Updated last month
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆29Updated 9 months ago
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆36Updated 6 months ago
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆28Updated 10 months ago
- ☆34Updated 2 years ago
- Repo of NeurIPS23☆14Updated last year