zhangy0822/USER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhangy0822/USER)

zhangy0822 / USER

USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024

☆33

Alternatives and similar repositories for USER

Users that are interested in USER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CrossmodalGroup / ESL
View on GitHub
☆12May 3, 2024Updated 2 years ago
vkhoi / cora_cvpr24
View on GitHub
☆28Sep 3, 2024Updated last year
iLearn-Lab / SIGIR21-DIME
View on GitHub
Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21
☆69Apr 5, 2026Updated 3 months ago
lerogo / aaai24_itr_cusa
View on GitHub
Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"
☆55Mar 28, 2024Updated 2 years ago
QinYang79 / CRCL
View on GitHub
Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)
☆15Jun 6, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
clairecyq / whos-waldo
View on GitHub
Who's Waldo? Linking People Across Text and Images. ICCV 2021.
☆14May 17, 2023Updated 3 years ago
DarrenZZhang / MM23-MITH
View on GitHub
☆21Apr 10, 2024Updated 2 years ago
multimodal-interpretability / nnn
View on GitHub
Nearest Neighbor Normalization (EMNLP 2024)
☆21Nov 1, 2024Updated last year
CuthbertCai / Ask-Confirm
View on GitHub
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)
☆20Dec 4, 2021Updated 4 years ago
LuminosityX / FNE
View on GitHub
Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..
☆20Dec 3, 2023Updated 2 years ago
VL-Group / 2022-NeurIPS-DAA
View on GitHub
The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted b…
☆19Jan 16, 2024Updated 2 years ago
BMC-SDNU / Cross-Modal-Retrieval
View on GitHub
Cross-Modal-Real-valuded-Retrieval
☆88Jul 18, 2023Updated 3 years ago
qxzha / UGNCL
View on GitHub
Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching (ACM SIGIR 2024, Pytorch Code)
☆22Apr 16, 2026Updated 3 months ago
CrossmodalGroup / NAAF
View on GitHub
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
☆119Jun 19, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
PKU-ICST-MIPL / MKVSE-TOMM2023
View on GitHub
☆28May 16, 2023Updated 3 years ago
KevinLight831 / ESA
View on GitHub
[TCSVT2023] - ESA: External Space Attention Aggregation for Image-Text Retrieval
☆23Aug 30, 2024Updated last year
CrossmodalGroup / HREM
View on GitHub
Learning Semantic Relationship among Instances for Image-Text Matching, CVPR, 2023
☆93Apr 21, 2025Updated last year
alipay / PC2-NoiseofWeb
View on GitHub
Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …
☆16Nov 20, 2025Updated 8 months ago
uvavision / DrillDown
View on GitHub
[NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries
☆12Apr 15, 2022Updated 4 years ago
96-Zachary / vse_2ad
View on GitHub
☆15Apr 30, 2022Updated 4 years ago
LgQu / CAMERA
View on GitHub
Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20
☆29May 26, 2022Updated 4 years ago
MediaBrain-SJTU / GSC
View on GitHub
☆14Jul 13, 2024Updated 2 years ago
jaychempan / PriorCLIP
View on GitHub
Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”
☆30Dec 19, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Cuberick-Orion / Bi-Blip4CIR
View on GitHub
The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…
☆34Feb 7, 2024Updated 2 years ago
hhc1997 / MSCN
View on GitHub
☆12Mar 28, 2024Updated 2 years ago
kuanghuei / SCAN
View on GitHub
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
☆579May 18, 2023Updated 3 years ago
weimingboya / DFT
View on GitHub
☆13Jun 2, 2023Updated 3 years ago
OpenMatch / UniVL-DR
View on GitHub
[ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…
☆52Jul 3, 2024Updated 2 years ago
XLearning-SCU / 2021-NeurIPS-NCR
View on GitHub
☆82Nov 6, 2023Updated 2 years ago
xinwei666 / MMGenerativeIR
View on GitHub
Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".
☆28Sep 15, 2025Updated 10 months ago
XLearning-SCU / 2024-TIP-CREAM
View on GitHub
PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)
☆22Mar 25, 2024Updated 2 years ago
Paranioar / Awesome_Matching_Pretraining_Transfering
View on GitHub
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…
☆446Sep 25, 2025Updated 9 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Xiaohui9607 / LLM_layout_generator
View on GitHub
LLM as Layout generator designed for improving compositional ability of stable diffusion models
☆17Dec 4, 2023Updated 2 years ago
shuanglinyan / CFine
View on GitHub
CLIP-Driven Fine-grained Text-Image Person Re-identification
☆67Nov 22, 2023Updated 2 years ago
Paranioar / SGRAF
View on GitHub
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
☆220Apr 11, 2024Updated 2 years ago
MartinYuanNJU / SEMScene
View on GitHub
Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval".
☆26Nov 13, 2024Updated last year
TangXu-Group / Cross-modal-remote-sensing-image-and-text-retrieval-models
View on GitHub
☆22Sep 19, 2024Updated last year
AndresPMD / GCN_classification
View on GitHub
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
☆65Dec 1, 2022Updated 3 years ago
AAA-Zheng / Image-Text-Matching-Summary
View on GitHub
Summary of Related Research on Image-Text Matching
☆75May 20, 2023Updated 3 years ago