[EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"
☆25Mar 30, 2026Updated 3 months ago
Alternatives and similar repositories for GA-DMS
Users that are interested in GA-DMS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆16Jul 15, 2025Updated 11 months ago
- Video Benchmark Suite: Rapid Evaluation of Video Foundation Models☆17Jan 10, 2025Updated last year
- The official repo for the DanQing dataset.☆36Mar 25, 2026Updated 3 months ago
- 【CVPR 2025】Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment☆39Sep 17, 2025Updated 9 months ago
- Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification (CVPR 2025 Pytorch Code)☆48Jul 19, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- V-SWIFT: Training a Small VideoMAE Model on a Single Machine in a Day☆30Feb 5, 2025Updated last year
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆91Jul 13, 2024Updated last year
- Code for Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification (CVPR2025)☆48Nov 4, 2025Updated 7 months ago
- ☆11Aug 4, 2024Updated last year
- PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)☆22Mar 25, 2024Updated 2 years ago
- Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"☆48Apr 3, 2025Updated last year
- ☆37Mar 28, 2025Updated last year
- [ACM MM2025] The official repository for the RealSyn dataset☆39Dec 14, 2025Updated 6 months ago
- Extending a simple RAG with Langchain & RAGAS☆25Dec 14, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Margin-based Vision Transformer☆69Apr 7, 2026Updated 2 months ago
- MLCD-Seg is a zero-shot segmentation model from DeepGlint.☆18Jul 4, 2025Updated 11 months ago
- FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens☆17Sep 8, 2025Updated 9 months ago
- Fully Open Framework for Democratized Multimodal Reinforcement Learning.☆51Dec 19, 2025Updated 6 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆84May 24, 2026Updated last month
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆56Apr 7, 2025Updated last year
- [ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"☆106Dec 8, 2025Updated 6 months ago
- ☆32Sep 24, 2023Updated 2 years ago
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆105Sep 18, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Oct 30, 2024Updated last year
- 【IJCAI 2023】RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search☆78Jul 9, 2023Updated 2 years ago
- [AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"☆74Dec 8, 2025Updated 6 months ago
- The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"☆174Jul 23, 2025Updated 11 months ago
- 用pytorch训练ssd,相比原版pytorch-ssd改动了不少☆11Jul 4, 2022Updated 3 years ago
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆27Apr 24, 2025Updated last year
- a library of works related to Large Language Models (LLMs) based Agent Hallucination☆59Oct 30, 2025Updated 8 months ago
- [CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity☆81Sep 28, 2024Updated last year
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆151Dec 14, 2025Updated 6 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆20Feb 26, 2025Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆70Mar 17, 2026Updated 3 months ago
- Learning to Annotate Part Segmentation with Gradient Matching (ICLR 2022)☆12Apr 26, 2022Updated 4 years ago
- Pytorch implementation for Negation-Aware Test-Time Adaptation for Vision-Language Models.☆36Mar 18, 2026Updated 3 months ago
- train ssd☆10Apr 30, 2019Updated 7 years ago
- ☆11May 12, 2023Updated 3 years ago
- ☆11May 15, 2020Updated 6 years ago