XLearning-SCU / LLaVA-ReIDLinks
Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification
☆33Updated this week
Alternatives and similar repositories for LLaVA-ReID
Users that are interested in LLaVA-ReID are comparing it to the libraries listed below
Sorting:
- Code for Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification (CVPR2025)☆27Updated 2 months ago
- Noisy-Correspondence Learning for Text-to-Image Person Re-identification (CVPR 2024 Pytorch Code)☆92Updated 7 months ago
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆68Updated last year
- 【CVPR2025】IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification☆29Updated 3 months ago
- PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)☆17Updated last year
- [CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity☆66Updated 9 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆71Updated last week
- 【AAAI2025】DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification☆55Updated 4 months ago
- ☆30Updated last year
- Robust Pseudo-label Learning with Neighbor Relation for Unsupervised Visible-Infrared Person Re-Identification☆14Updated 6 months ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆52Updated last year
- ☆20Updated 4 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 9 months ago
- 【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search☆67Updated last year
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆53Updated last week
- Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale person image databas…☆25Updated last month
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆83Updated 6 months ago
- ☆9Updated 9 months ago
- Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification (CVPR 2025 Pytorch Code)☆23Updated last month
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆76Updated last year
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆14Updated last year
- Towards Modality-Agnostic Person Re-identification with Descriptive Query CVPR2023☆25Updated 11 months ago
- 【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt☆75Updated 2 months ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆51Updated 3 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆106Updated last month
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆73Updated 5 months ago
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆83Updated last year
- 【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification☆107Updated 8 months ago
- ☆24Updated last year
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆14Updated 7 months ago