Multimodal-Representation-Learning-MRL/GA-DMS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Multimodal-Representation-Learning-MRL/GA-DMS)

Multimodal-Representation-Learning-MRL / GA-DMS

[EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"

☆25

Alternatives and similar repositories for GA-DMS

Users that are interested in GA-DMS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Flame-Chasers / Bi-IRRA
View on GitHub
【TPAMI 2025】Bi-IRRA: Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Alignment
☆16Oct 3, 2025Updated 9 months ago
QinYang79 / ICL
View on GitHub
Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification (CVPR 2025 Pytorch Code)
☆49Jul 19, 2025Updated last year
MPI-Lab / HAM
View on GitHub
Code for Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification (CVPR2025)
☆50Nov 4, 2025Updated 8 months ago
CFM-MSG / Code-AUL
View on GitHub
☆19Mar 5, 2024Updated 2 years ago
deepglint / RealSyn
View on GitHub
[ACM MM2025] The official repository for the RealSyn dataset
☆39Dec 14, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
QinYang79 / RDE
View on GitHub
Noisy-Correspondence Learning for Text-to-Image Person Re-identification (CVPR 2024 Pytorch Code)
☆130Nov 28, 2024Updated last year
Liu-Yating / DM-Adapter
View on GitHub
About [AAAI 2025] Official repository of paper titled "DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval"
☆16Feb 9, 2025Updated last year
zqxie77 / CONQUER
View on GitHub
☆44Jun 8, 2026Updated last month
xiaoxing2001 / DeGLA
View on GitHub
[ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]
☆16Jul 15, 2025Updated last year
XLearning-SCU / 2024-TIP-CREAM
View on GitHub
PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)
☆22Mar 25, 2024Updated 2 years ago
MPI-Lab / MLLM4Text-ReID
View on GitHub
Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)
☆91Jul 13, 2024Updated 2 years ago
AsuradaYuci / CLIMB-ReID
View on GitHub
CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification（AAAI2025）
☆54Nov 24, 2025Updated 7 months ago
Flame-Chasers / RaSa
View on GitHub
【IJCAI 2023】RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search
☆78Jul 9, 2023Updated 3 years ago
LinDixuan / CADA
View on GitHub
☆20Jul 9, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
MediaBrain-SJTU / GSC
View on GitHub
☆14Jul 13, 2024Updated 2 years ago
shuanglinyan / CFine
View on GitHub
CLIP-Driven Fine-grained Text-Image Person Re-identification
☆67Nov 22, 2023Updated 2 years ago
Zplusdragon / UFineBench
View on GitHub
[CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity
☆81Sep 28, 2024Updated last year
ZhiyinShao-H / UniPT
View on GitHub
☆32Sep 24, 2023Updated 2 years ago
hhc1997 / L2RM
View on GitHub
☆43Mar 28, 2024Updated 2 years ago
Zplusdragon / ReID5o_ORBench
View on GitHub
[NeurIPS2025] ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
☆93Jan 8, 2026Updated 6 months ago
qijimrc / mm_evaluation
View on GitHub
☆11Aug 4, 2024Updated last year
lezhang7 / Enhance-FineGrained
View on GitHub
[CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding
☆56Apr 7, 2025Updated last year
Yifei-AHU / AERI-PEDES
View on GitHub
Code and Dataset of paper "Cross-modal Fuzzy Alignment Network for Text-Aerial Person Retrieval and A Large-scale Benchmark" (CVPR 2026)
☆20Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
GaryGuTC / UniME-v2
View on GitHub
[AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"
☆74Dec 8, 2025Updated 7 months ago
deepglint / UniME
View on GitHub
[ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"
☆105Dec 8, 2025Updated 7 months ago
QinYang79 / Awesome-Noisy-Correspondence
View on GitHub
This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…
☆86May 24, 2026Updated last month
XLearning-SCU / Awesome-Noisy-Correspondence
View on GitHub
This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…
☆138May 23, 2026Updated last month
lezhang7 / SAIL
View on GitHub
[CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"
☆60Aug 15, 2025Updated 11 months ago
Shuyu-XJTU / APTM
View on GitHub
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
☆174Jul 6, 2026Updated 2 weeks ago
deepglint / MLCD-Seg
View on GitHub
MLCD-Seg is a zero-shot segmentation model from DeepGlint.
☆18Jul 4, 2025Updated last year
Oneflow-Inc / oneflow_face
View on GitHub
☆12Aug 10, 2022Updated 3 years ago
siyuancncd / FUME
View on GitHub
This is the official implementation of "Fuzzy Multimodal Learning for Trusted Cross-modal Retrieval" (CVPR 2025)
☆40Jul 6, 2026Updated 2 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
WHU-HZY / TVI-LFM
View on GitHub
The implementation of paper: Empowering Visible-Infrared Person Re-Identification with Large Foundation Models, NeurIPS 2024
☆32Dec 30, 2024Updated last year
ccq195 / UNIReID
View on GitHub
Towards Modality-Agnostic Person Re-identification with Descriptive Query CVPR2023
☆31Aug 4, 2024Updated last year
ruc-aimc-lab / TeachCLIP
View on GitHub
[CVPR 2024] TeachCLIP for Text-to-Video Retrieval
☆42May 7, 2025Updated last year
cqu-student / Wiki-PRF
View on GitHub
☆19Mar 9, 2026Updated 4 months ago
deepglint / ALIP
View on GitHub
[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
☆106Sep 18, 2023Updated 2 years ago
ShareLab-SII / CoMP-MM
View on GitHub
Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"
☆48Apr 3, 2025Updated last year
LunarShen / DsicoVLA
View on GitHub
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
☆22Jun 23, 2025Updated last year