Shuyu-XJTU / CMPLinks
The official code of "Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search"
☆24Updated 3 months ago
Alternatives and similar repositories for CMP
Users that are interested in CMP are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆90Updated last month
- ☆49Updated 10 months ago
- Multimodal-Composite-Editing-and-Retrieval-update☆34Updated 2 months ago
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆43Updated last year
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆52Updated last year
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆54Updated 4 months ago
- [CVPR 2025] Official PyTorch Code for "MMRL: Multi-Modal Representation Learning for Vision-Language Models" and its extension "MMRL++: P…☆87Updated 5 months ago
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆125Updated 3 months ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆55Updated 6 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆73Updated 10 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆108Updated 3 weeks ago
- Unofficial Implementation to CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification [ICCV'23]☆34Updated last year
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆58Updated 3 weeks ago
- Official PyTorch Code for Anchor Token Guided Prompt Learning Methods: [ICCV 2025] ATPrompt and [Arxiv 2511.21188] AnchorOPT☆117Updated this week
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆91Updated last year
- The official implementation of paper: "Inter-Instance Similarity Modeling for Contrastive Learning"☆117Updated last year
- [AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training☆105Updated last year
- A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…☆75Updated 4 months ago
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆28Updated last year
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)☆29Updated last year
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆87Updated last year
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆62Updated 5 months ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆58Updated 2 years ago
- ☆57Updated 5 months ago
- 【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search☆72Updated last year
- The efficient tuning method for VLMs☆80Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆73Updated 2 years ago
- Detail-Oriented CLIP for Fine-Grained Tasks (ICLR SSI-FM 2025)☆55Updated 8 months ago
- ☆52Updated 2 years ago
- ☆32Updated 2 years ago