Delong-liu-bupt / Composed_Person_RetrievalLinks
[NeurIPS 2025] Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale person image databases by combining both a reference image and a textual description as the query.
☆63Updated last week
Alternatives and similar repositories for Composed_Person_Retrieval
Users that are interested in Composed_Person_Retrieval are comparing it to the libraries listed below
Sorting:
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆85Updated last year
- Multimodal Referring Segmentation☆169Updated last month
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆238Updated 2 years ago
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆189Updated 2 years ago
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆359Updated 3 years ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆77Updated last month
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Updated 2 years ago
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆81Updated 2 years ago
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆58Updated last month
- 【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search☆69Updated last year
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆358Updated last month
- ☆158Updated 2 years ago
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆79Updated last year
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆22Updated 4 months ago
- Abstract. Person search is a challenging problem with various real- world applications, that aims at joint person detection and re-identi…☆13Updated last year
- Noisy-Correspondence Learning for Text-to-Image Person Re-identification (CVPR 2024 Pytorch Code)☆104Updated 11 months ago
- 【IJCAI 2023】RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search☆67Updated 2 years ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆54Updated 2 months ago
- DDAM-PS: Diligent Domain Adaptive Mixer for Person Search -- WACV2024☆12Updated last year
- [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation☆692Updated 2 years ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆53Updated last year
- Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification (CVPR 2025 Pytorch Code)☆28Updated 3 months ago
- Code for Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification (CVPR2025)☆34Updated 5 months ago
- [ICCV 2025] Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation☆51Updated 2 months ago
- [CVPR2023 Highlight] Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection☆312Updated 2 years ago
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆65Updated last year
- A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (RSCM…☆62Updated 7 months ago
- ☆14Updated last year
- [ICCV 2023 & TPAMI 2025] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions☆520Updated 2 months ago
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆248Updated 7 months ago