Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"
☆28Dec 6, 2023Updated 2 years ago
Alternatives and similar repositories for ALADIN
Users that are interested in ALADIN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching☆16Dec 10, 2021Updated 4 years ago
- Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…☆74Dec 6, 2023Updated 2 years ago
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆29May 26, 2022Updated 3 years ago
- ☆47Jan 14, 2026Updated 2 months ago
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Jun 16, 2023Updated 2 years ago
- AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)☆60Mar 1, 2025Updated last year
- Code to reproduce 'Combining GANs and AutoEncoders for efficient anomaly detection'☆40Nov 2, 2021Updated 4 years ago
- ☆13Feb 1, 2022Updated 4 years ago
- Paper reading notes in the field of Image-Text Matching/Retrieval.☆13Mar 25, 2022Updated 4 years ago
- ☆28May 16, 2023Updated 2 years ago
- Summary of Related Research on Image-Text Matching☆74May 20, 2023Updated 2 years ago
- ☆15Apr 30, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆53Sep 13, 2023Updated 2 years ago
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15May 18, 2024Updated last year
- Generative label fused network for image–text matching☆10Jan 13, 2023Updated 3 years ago
- Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching☆39Jun 19, 2023Updated 2 years ago
- Implementation of "Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning" WACV 2023.☆26Sep 6, 2023Updated 2 years ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- ☆10Apr 7, 2024Updated last year
- ☆12Jul 7, 2024Updated last year
- ☆14Sep 27, 2025Updated 5 months ago
- A Docker-Based Federated Learning Framework Design and Deployment for Multi-modal Data Stream Classification☆13Feb 15, 2024Updated 2 years ago
- The official implementation of Bayesian Cross-modal Alignment Learning for Few-Shot Out-of-Distribution Generalization (AAAI2023).☆20Oct 13, 2025Updated 5 months ago
- ☆14Oct 14, 2019Updated 6 years ago
- ☆13Mar 28, 2024Updated last year
- Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.☆74Oct 25, 2022Updated 3 years ago
- The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…☆46Jun 5, 2023Updated 2 years ago
- ☆25May 11, 2022Updated 3 years ago
- Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)☆138Mar 1, 2024Updated 2 years ago
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆12Jun 18, 2025Updated 9 months ago
- ☆34Jun 14, 2022Updated 3 years ago
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆35Mar 24, 2025Updated last year
- Stacked attention network for answering open-ended questions about image☆12May 31, 2018Updated 7 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Jun 19, 2019Updated 6 years ago
- RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval (CVPR 2023, PyTorch Code)☆21Mar 11, 2024Updated 2 years ago
- ☆17May 31, 2023Updated 2 years ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- [AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning☆15Apr 29, 2024Updated last year