LCFractal / TGDT
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training
☆23Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TGDT
- Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'☆20Updated 11 months ago
- ☆12Updated 6 months ago
- The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.☆13Updated 11 months ago
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆21Updated 8 months ago
- ☆23Updated last year
- ☆41Updated last year
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Updated last year
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆29Updated 7 months ago
- Summary of Related Research on Image-Text Matching☆67Updated last year
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆26Updated 7 months ago
- Cross-Modal-Real-valuded-Retrieval☆76Updated last year
- The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…☆41Updated last year
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆66Updated 2 years ago
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆30Updated 2 years ago
- ☆10Updated last year
- Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.☆111Updated last year
- ☆17Updated 7 months ago
- Local self-attention in Transformer for visual question answering☆12Updated 8 months ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆156Updated last year
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆19Updated 6 months ago
- PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-…☆36Updated 2 years ago
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆14Updated 5 months ago
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆11Updated last year
- [AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”☆214Updated 7 months ago
- Source codes of the paper "When CLIP meets Cross-modal Hashing Retrieval: A New Strong Baseline"☆24Updated 8 months ago
- Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]☆67Updated 5 months ago
- Cross-Modal Retrieval with Partially Mismatched Pairs (IEEE TPAMI 2023, PyTorch Code)☆18Updated last year
- ☆14Updated last year
- source code for "Deep adversarial discrete hashing for cross-modal retrieval"☆24Updated last year
- Repository for an end-to-end image captioning method PTSN(ACM MM22).☆60Updated last year