USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆33Jun 18, 2025Updated last year
Alternatives and similar repositories for USER
Users that are interested in USER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12May 3, 2024Updated 2 years ago
- ☆28Sep 3, 2024Updated last year
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated 2 years ago
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆70Apr 5, 2026Updated 2 months ago
- ☆53Sep 13, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆165Aug 24, 2025Updated 10 months ago
- Nearest Neighbor Normalization (EMNLP 2024)☆21Nov 1, 2024Updated last year
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆15Jun 6, 2024Updated 2 years ago
- Who's Waldo? Linking People Across Text and Images. ICCV 2021.☆13May 17, 2023Updated 3 years ago
- ☆22Apr 10, 2024Updated 2 years ago
- The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted b…☆19Jan 16, 2024Updated 2 years ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..☆20Dec 3, 2023Updated 2 years ago
- Cross-Modal-Real-valuded-Retrieval☆88Jul 18, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching (ACM SIGIR 2024, Pytorch Code)☆22Apr 16, 2026Updated 2 months ago
- Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.☆119Jun 19, 2023Updated 3 years ago
- ☆28May 16, 2023Updated 3 years ago
- [TCSVT2023] - ESA: External Space Attention Aggregation for Image-Text Retrieval☆23Aug 30, 2024Updated last year
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆16Nov 20, 2025Updated 7 months ago
- Learning Semantic Relationship among Instances for Image-Text Matching, CVPR, 2023☆93Apr 21, 2025Updated last year
- [NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries☆12Apr 15, 2022Updated 4 years ago
- ☆14Jul 13, 2024Updated last year
- ☆15Apr 30, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆29May 26, 2022Updated 4 years ago
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆34Feb 7, 2024Updated 2 years ago
- ☆14Dec 31, 2024Updated last year
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆41Nov 15, 2023Updated 2 years ago
- A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (RSCM…☆68Mar 10, 2025Updated last year
- PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)☆579May 18, 2023Updated 3 years ago
- ☆13Jun 2, 2023Updated 3 years ago
- Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”☆30Dec 19, 2025Updated 6 months ago
- ☆17Nov 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)☆22Mar 25, 2024Updated 2 years ago
- [ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…☆52Jul 3, 2024Updated 2 years ago
- The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…☆445Sep 25, 2025Updated 9 months ago
- ☆82Nov 6, 2023Updated 2 years ago
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆28Sep 15, 2025Updated 9 months ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆65Nov 22, 2023Updated 2 years ago
- [AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”☆219Apr 11, 2024Updated 2 years ago