zjukg / DUETLinks
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
☆53Updated last year
Alternatives and similar repositories for DUET
Users that are interested in DUET are comparing it to the libraries listed below
Sorting:
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆70Updated 2 years ago
- ☆79Updated 2 years ago
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.☆50Updated 6 months ago
- Code release for Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning | IROS 2024☆50Updated last year
- [NeurIPS 2024] Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models☆39Updated last year
- ☆26Updated 2 years ago
- Official PyTorch Implementation of TransZero (AAAI'22)☆83Updated 2 years ago
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆34Updated last year
- Official PyTorch Implementation of MSDN (CVPR'22)☆54Updated 3 years ago
- ☆13Updated last year
- Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval ( ACM Multimedia 2022, Pytorch Code)☆47Updated last year
- Code implementation for the paper: Supervised Masked Knowledge Distillation for Few-Shot Transformers☆43Updated 2 years ago
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆70Updated 3 years ago
- This is a PyTorch implementation of the paper "Attribute Prototype Network for Zero-Shot Learning".☆75Updated 3 years ago
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆33Updated 2 years ago
- Multi-label Image Recognition with Partial Labels (IJCV'24, ESWA'24, AAAI'22)☆43Updated last year
- The summary of code and paper for few-shot learning in fine-grained recognition☆80Updated 6 months ago
- Unofficial Implementation to CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification [ICCV'23]☆34Updated last year
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆73Updated 11 months ago
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆29Updated 3 years ago
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆33Updated 7 months ago
- [ICLR 2023] Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning"☆60Updated 2 years ago
- Pytorch implementation of DAPrompt: https://arxiv.org/abs/2202.06687☆96Updated 2 years ago
- [CVPR 2024] Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning☆29Updated 10 months ago
- ☆42Updated 2 years ago
- Cross-Modal-Real-valuded-Retrieval☆86Updated 2 years ago
- ☆95Updated 2 years ago
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆44Updated last year
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23☆226Updated 2 years ago
- Official PyTorch Implementation of ZSLViT (CVPR'24)☆16Updated last year