Xiaodongsuper / SCALE_code
M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining CVPR 2022
☆34Updated 2 years ago
Alternatives and similar repositories for SCALE_code
Users that are interested in SCALE_code are comparing it to the libraries listed below
Sorting:
- M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining CVPR 2022. Dataset toolkit☆24Updated 3 years ago
- ☆15Updated 2 years ago
- M5Product Main Page.☆14Updated 3 years ago
- Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.☆115Updated last year
- A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval☆42Updated 3 years ago
- Product1M☆87Updated 2 years ago
- Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval ( ACM Multimedia 2022, Pytorch Code)☆43Updated last year
- ☆74Updated last year
- This repo holds the Pytorch codes and models for the BTH framework presented on CVPR 2021☆33Updated 3 years ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Updated 2 years ago
- The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…☆45Updated last year
- ☆36Updated 2 years ago
- Learning Cross-Modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)☆53Updated 2 years ago
- Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching☆38Updated last year
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆71Updated 2 years ago
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆83Updated last year
- Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.☆72Updated 2 years ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆161Updated 2 years ago
- https://layer6ai-labs.github.io/xpool/☆124Updated last year
- The source code of "Bit-aware Semantic Transformer Hashing for Multi-modal Retrieval." (Accepted by SIGIR 2022)☆16Updated 2 years ago
- Summary of Related Research on Image-Text Matching☆70Updated last year
- [CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The…☆67Updated last month
- ☆11Updated 2 years ago
- ☆27Updated last year
- ☆44Updated last year
- The code for "Semi-Supervised Cross-Modal Hashing with Multi-view Graph Representation"☆10Updated 4 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆262Updated 7 months ago
- Xiaodongsuper / Adaptive-Collaborative-Similarity-Learning-for-Unsupervised-Multi-view-Feature-SelectionAdaptive Collaborative Similarity Learning for Unsupervised Multi-view Feature Selection IJCAI2018☆59Updated 5 years ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆63Updated last month
- The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted b…☆19Updated last year