LgQu/DIME

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LgQu/DIME)

LgQu / DIME

Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21

☆70

Alternatives and similar repositories for DIME

Users that are interested in DIME are comparing it to the libraries listed below

Sorting:

LgQu / CAMERA
View on GitHub
Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20
☆29May 26, 2022Updated 3 years ago
Shiyang-Yan / Discrete-continous-PG-for-Retrieval
View on GitHub
☆13Feb 1, 2022Updated 4 years ago
woodfrog / vse_infty
View on GitHub
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)
☆164Aug 24, 2025Updated 6 months ago
Paranioar / SGRAF
View on GitHub
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
☆219Apr 11, 2024Updated last year
CrossmodalGroup / CMCAN
View on GitHub
Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.
☆36Jun 16, 2023Updated 2 years ago
zhangy0822 / USER
View on GitHub
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆33Jun 18, 2025Updated 9 months ago
KunpengLi1994 / VSRN
View on GitHub
PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"
☆302Jan 14, 2020Updated 6 years ago
CrossmodalGroup / NAAF
View on GitHub
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
☆119Jun 19, 2023Updated 2 years ago
Paranioar / Awesome_Matching_Pretraining_Transfering
View on GitHub
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…
☆445Sep 25, 2025Updated 5 months ago
kuanghuei / SCAN
View on GitHub
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
☆579May 18, 2023Updated 2 years ago
yiling2018 / saem
View on GitHub
Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019
☆41Sep 24, 2019Updated 6 years ago
HuiChen24 / IMRAM
View on GitHub
code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"
☆96Mar 8, 2020Updated 6 years ago
cyh-sj / CGMN
View on GitHub
The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…
☆46Jun 5, 2023Updated 2 years ago
hardyqr / HAL
View on GitHub
[AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".
☆38Oct 4, 2023Updated 2 years ago
CrossmodalGroup / GSMN
View on GitHub
Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching
☆170Oct 12, 2020Updated 5 years ago
96-Zachary / vse_2ad
View on GitHub
☆15Apr 30, 2022Updated 3 years ago
sunnychencool / AOQ
View on GitHub
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Jul 2, 2020Updated 5 years ago
vkhoi / cora_cvpr24
View on GitHub
☆27Sep 3, 2024Updated last year
jwehrmann / retrieval.pytorch
View on GitHub
Adaptive Cross-Modal Embeddings for Image-Sentence Alignment
☆36Oct 3, 2023Updated 2 years ago
mesnico / TERAN
View on GitHub
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…
☆75Dec 6, 2023Updated 2 years ago
LgQu / LeaPRR
View on GitHub
Learnable Pillar-based Re-ranking for Image-Text Retrieval. SIGIR'23
☆22Jul 31, 2023Updated 2 years ago
BruceW91 / CVSE
View on GitHub
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
☆168Feb 7, 2022Updated 4 years ago
CrossmodalGroup / HREM
View on GitHub
Learning Semantic Relationship among Instances for Image-Text Matching, CVPR, 2023
☆92Apr 21, 2025Updated 11 months ago
AAA-Zheng / Image-Text-Matching-Summary
View on GitHub
Summary of Related Research on Image-Text Matching
☆74May 20, 2023Updated 2 years ago
kdwonn / DivE
View on GitHub
Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)
☆41Nov 15, 2023Updated 2 years ago
HuiChen24 / MM_SemanticConsistency
View on GitHub
code for our MM2019 paper “Cross-Modal Image-Text Retrieval with Semantic Consistency”
☆17Dec 7, 2019Updated 6 years ago
AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
ppanzx / CHAN
View on GitHub
☆53Sep 13, 2023Updated 2 years ago
CrossmodalGroup / BFAN
View on GitHub
Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
☆39Jun 19, 2023Updated 2 years ago
ioanacroi / qb-norm
View on GitHub
Cross Modal Retrieval with Querybank Normalisation
☆57Nov 21, 2023Updated 2 years ago
CrossmodalGroup / ESL
View on GitHub
☆12May 3, 2024Updated last year
KevinLight831 / AMC
View on GitHub
[ToMM2023] - AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval
☆20Aug 30, 2024Updated last year
wenz116 / DRFT
View on GitHub
End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021
☆18Oct 24, 2021Updated 4 years ago
CuthbertCai / Ask-Confirm
View on GitHub
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)
☆20Dec 4, 2021Updated 4 years ago
VL-Group / 2022-NeurIPS-DAA
View on GitHub
The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted b…
☆19Jan 16, 2024Updated 2 years ago
CrossmodalGroup / ER-SAN
View on GitHub
Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.
☆24Aug 5, 2023Updated 2 years ago
mesnico / TERN
View on GitHub
Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144
☆58Dec 6, 2023Updated 2 years ago
XLearning-SCU / 2021-NeurIPS-NCR
View on GitHub
☆82Nov 6, 2023Updated 2 years ago
ecom-research / ComposeAE
View on GitHub
Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
☆56Oct 8, 2021Updated 4 years ago