PKU-ICST-MIPL/MKVSE-TOMM2023

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PKU-ICST-MIPL/MKVSE-TOMM2023)

PKU-ICST-MIPL / MKVSE-TOMM2023

☆28

Alternatives and similar repositories for MKVSE-TOMM2023

Users that are interested in MKVSE-TOMM2023 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BruceW91 / CVSE
View on GitHub
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
☆168Feb 7, 2022Updated 4 years ago
PKU-ICST-MIPL / MARS_TCSVT2021
View on GitHub
☆12Feb 2, 2023Updated 3 years ago
CrossmodalGroup / NAAF
View on GitHub
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
☆119Jun 19, 2023Updated 3 years ago
96-Zachary / vse_2ad
View on GitHub
☆15Apr 30, 2022Updated 4 years ago
vkhoi / cora_cvpr24
View on GitHub
☆28Sep 3, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AAA-Zheng / Image-Text-Matching-Summary
View on GitHub
Summary of Related Research on Image-Text Matching
☆75May 20, 2023Updated 3 years ago
zhangy0822 / USER
View on GitHub
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆33Jun 18, 2025Updated last year
GingL / CMPA
View on GitHub
☆16May 31, 2023Updated 3 years ago
CrossmodalGroup / ESL
View on GitHub
☆12May 3, 2024Updated 2 years ago
QinYang79 / CRCL
View on GitHub
Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)
☆15Jun 6, 2024Updated 2 years ago
Paranioar / SGRAF
View on GitHub
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
☆219Apr 11, 2024Updated 2 years ago
AndersonStra / Mucko
View on GitHub
implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
☆10Mar 17, 2022Updated 4 years ago
CrossmodalGroup / CMCAN
View on GitHub
Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.
☆36Jun 16, 2023Updated 3 years ago
dingdanhao110 / Conch
View on GitHub
☆11Jan 24, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
KevinLight831 / ESA
View on GitHub
[TCSVT2023] - ESA: External Space Attention Aggregation for Image-Text Retrieval
☆23Aug 30, 2024Updated last year
CrossmodalGroup / GSMN
View on GitHub
Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching
☆170Oct 12, 2020Updated 5 years ago
zjukg / DUET
View on GitHub
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
☆54Feb 9, 2024Updated 2 years ago
xinwei666 / MMGenerativeIR
View on GitHub
Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".
☆28Sep 15, 2025Updated 10 months ago
mesnico / ALADIN
View on GitHub
Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"
☆28Dec 6, 2023Updated 2 years ago
LgQu / CAMERA
View on GitHub
Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20
☆29May 26, 2022Updated 4 years ago
OreOZhao / CMR
View on GitHub
Code for "Contrast then Memorize: Semantic Neighbor Retrieval-Enhanced Inductive Multimodal Knowledge Graph Completion", SIGIR 2024.
☆15Feb 20, 2025Updated last year
cyh-sj / CGMN
View on GitHub
The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…
☆45Jun 5, 2023Updated 3 years ago
kuanghuei / SCAN
View on GitHub
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
☆579May 18, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Paranioar / Awesome_Matching_Pretraining_Transfering
View on GitHub
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…
☆446Sep 25, 2025Updated 10 months ago
mesnico / TERAN
View on GitHub
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…
☆74Dec 6, 2023Updated 2 years ago
YuanLi95 / KECPM
View on GitHub
Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))
☆12Aug 27, 2024Updated last year
alex-bogatu / d3l
View on GitHub
D3L dataset discovery framework - an implementation of the ICDE 2020 paper with the same name: https://arxiv.org/pdf/2011.10427.pdf
☆21Nov 18, 2021Updated 4 years ago
LCFractal / TGDT
View on GitHub
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training
☆30Jun 20, 2023Updated 3 years ago
coldmanck / RVL-BERT
View on GitHub
The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…
☆18Oct 21, 2022Updated 3 years ago
cwj1412 / MSCOCO-Flikcr30K_FG
View on GitHub
Benchmark data for "Rethinking Benchmarks for Cross-modal Image-text Retrieval" (SIGIR 2023)
☆28Apr 24, 2023Updated 3 years ago
informagi / GEEER
View on GitHub
Code supporting the paper Graph-Embedding Empowered Entity Retrieval
☆24Apr 11, 2025Updated last year
iLearn-Lab / SIGIR21-DIME
View on GitHub
Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21
☆68Apr 5, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
PRIS-CV / An-Erudite-FGVC-Model
View on GitHub
Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"
☆17Jun 2, 2023Updated 3 years ago
Shiyang-Yan / Discrete-continous-PG-for-Retrieval
View on GitHub
☆13Feb 1, 2022Updated 4 years ago
VinitSR7 / Image-Caption-Generation
View on GitHub
Image Captioning: Implementing the Neural Image Caption Generator
☆21Oct 14, 2020Updated 5 years ago
miccunifi / Cross-the-Gap
View on GitHub
[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
☆70Nov 30, 2025Updated 7 months ago
teanalab / kewer
View on GitHub
Knowledge graph Entity and Word Embeddings for Retrieval
☆11Nov 19, 2021Updated 4 years ago
fartashf / vsepp
View on GitHub
PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"
☆523Dec 8, 2021Updated 4 years ago
sam1016yu / DB-Exp-Sensitivity
View on GitHub
A Study of Database Performance Sensitivity to Experiment Settings
☆11May 31, 2022Updated 4 years ago