VL-Group / 2022-NeurIPS-DAA
The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted by NeurIPS' 2022.
☆19Updated last year
Alternatives and similar repositories for 2022-NeurIPS-DAA:
Users that are interested in 2022-NeurIPS-DAA are comparing it to the libraries listed below
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆22Updated 11 months ago
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆71Updated 2 years ago
- ☆74Updated last year
- The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.☆14Updated last year
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆31Updated last year
- ☆13Updated last year
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆12Updated 5 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆62Updated last month
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆16Updated 11 months ago
- Repository for an end-to-end image captioning method PTSN(ACM MM22).☆61Updated 2 years ago
- Cross-Modal-Real-valuded-Retrieval☆81Updated last year
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆35Updated 9 months ago
- Official PyTorch implementation of the paper "Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval"☆9Updated last year
- ☆18Updated last year
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆82Updated last year
- Awesome Vision-Language Pretraining Papers☆30Updated 3 months ago
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆29Updated 2 years ago
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Updated last year
- A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval☆42Updated 3 years ago
- [TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”☆46Updated last year
- Cross-Modal Retrieval with Partially Mismatched Pairs (IEEE TPAMI 2023, PyTorch Code)☆20Updated last year
- Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval…☆41Updated last year
- ☆19Updated 9 months ago
- The source code of "Bit-aware Semantic Transformer Hashing for Multi-modal Retrieval." (Accepted by SIGIR 2022)☆16Updated 2 years ago
- Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval ( ACM Multimedia 2022, Pytorch Code)☆43Updated last year
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆35Updated 7 months ago
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆14Updated last year
- ☆25Updated 8 months ago
- ☆47Updated last year
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆41Updated last year