Delong-liu-bupt / Composed_Person_RetrievalLinks
[NeurIPS 2025] Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale person image databases by combining both a reference image and a textual description as the query.
☆71Updated 2 months ago
Alternatives and similar repositories for Composed_Person_Retrieval
Users that are interested in Composed_Person_Retrieval are comparing it to the libraries listed below
Sorting:
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆86Updated last year
- Multimodal Referring Segmentation☆197Updated last month
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆242Updated last month
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆190Updated 2 years ago
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆360Updated 4 years ago
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Updated 2 years ago
- 【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search☆73Updated last year
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆85Updated 3 months ago
- 【IJCAI 2023】RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search☆70Updated 2 years ago
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆362Updated 3 months ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆60Updated 2 years ago
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆28Updated 6 months ago
- Code for Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification (CVPR2025)☆36Updated 2 months ago
- ☆157Updated 2 years ago
- Noisy-Correspondence Learning for Text-to-Image Person Re-identification (CVPR 2024 Pytorch Code)☆110Updated last year
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆84Updated last year
- Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"☆241Updated 3 weeks ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆58Updated last month
- [ToMM2023] - AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval☆20Updated last year
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆82Updated 3 months ago
- DDAM-PS: Diligent Domain Adaptive Mixer for Person Search -- WACV2024☆12Updated last year
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆82Updated 2 years ago
- ☆95Updated 2 years ago
- Abstract. Person search is a challenging problem with various real- world applications, that aims at joint person detection and re-identi…☆13Updated last year
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆20Updated 10 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆76Updated 2 months ago
- [AAAI 2024] Heterogeneous Test-time Training for Multi-modal Person Re-identification☆16Updated 6 months ago
- Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval☆27Updated 9 months ago
- [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation☆690Updated last month
- [CVPR2023 Highlight] Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection☆314Updated 2 years ago