[CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity
☆79Sep 28, 2024Updated last year
Alternatives and similar repositories for UFineBench
Users that are interested in UFineBench are comparing it to the libraries listed below
Sorting:
- [NeurIPS2024] PLIP: Language-Image Pre-training for Person Representation Learning☆132Dec 17, 2024Updated last year
- ☆17Mar 5, 2024Updated last year
- The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"☆168Jul 23, 2025Updated 7 months ago
- Noisy-Correspondence Learning for Text-to-Image Person Re-identification (CVPR 2024 Pytorch Code)☆113Nov 28, 2024Updated last year
- Code for Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification (CVPR2025)☆38Nov 4, 2025Updated 3 months ago
- 【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search☆73Apr 15, 2024Updated last year
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆268Mar 26, 2025Updated 11 months ago
- ☆37Mar 28, 2025Updated 11 months ago
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆96Nov 20, 2025Updated 3 months ago
- [AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhi…☆27Jul 29, 2024Updated last year
- ☆18Jul 9, 2024Updated last year
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆20Feb 26, 2025Updated last year
- ☆54Apr 13, 2023Updated 2 years ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Jul 9, 2024Updated last year
- [NeurIPS2025] ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model☆83Jan 8, 2026Updated last month
- ☆26Aug 9, 2024Updated last year
- [NeurIPS 2025] Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale perso…☆73Oct 20, 2025Updated 4 months ago
- Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI …☆456Nov 21, 2023Updated 2 years ago
- Code of SSAN☆69Mar 7, 2024Updated last year
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆30Jul 2, 2025Updated 7 months ago
- MSRSegNet: Multi-Scale Residual Network for Semantic Segmentation☆10Aug 9, 2018Updated 7 years ago
- 【CVPR 2025】Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment☆32Sep 17, 2025Updated 5 months ago
- Papers of "A Survey on Multimodal LLMs from the Perspective of Input-Output Space Extension"☆17Feb 4, 2026Updated 3 weeks ago
- [Neural Networks 2025]Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval☆11Dec 24, 2024Updated last year
- Images of the PerxonX dataset and the original 3D human models of this dataset☆118Aug 17, 2022Updated 3 years ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆61Nov 22, 2023Updated 2 years ago
- ☆17Apr 18, 2025Updated 10 months ago
- Official pytorch implementation of the ICML2024 main conference paper: Pedestrian Attribute Recognition as Label-balanced Multi-label Lea…☆13Jul 22, 2024Updated last year
- Code of AAAI2025 Paper 《VIoTGPT: Learning to Schedule Vision Tools in LLMs towards Intelligent Video Internet of Things》☆15Jan 16, 2025Updated last year
- [ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues☆17Dec 31, 2024Updated last year
- [ECCV-2024] DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition☆34May 3, 2025Updated 9 months ago
- [OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch☆180Feb 8, 2026Updated 3 weeks ago
- ☆36Mar 28, 2024Updated last year
- Hierarchical Shot Detector (ICCV2019)☆57Dec 6, 2020Updated 5 years ago
- ☆22Apr 3, 2024Updated last year
- ☆15Dec 3, 2021Updated 4 years ago
- ☆17Mar 30, 2024Updated last year
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- [BMVC 2021] Text-Based Person Search with Limited Data☆47Aug 12, 2022Updated 3 years ago