mesnico/ALADIN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mesnico/ALADIN)

mesnico / ALADIN

Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"

☆28

Alternatives and similar repositories for ALADIN

Users that are interested in ALADIN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
mesnico / TERAN
View on GitHub
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…
☆74Dec 6, 2023Updated 2 years ago
CrossmodalGroup / CMCAN
View on GitHub
Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.
☆36Jun 16, 2023Updated 3 years ago
GabrieleLagani / HebbianLearning
View on GitHub
Pytorch implementation of Hebbian learning algorithms to train deep convolutional neural networks.
☆27Jul 2, 2024Updated 2 years ago
lerogo / aaai24_itr_cusa
View on GitHub
Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"
☆55Mar 28, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yiling2018 / saem
View on GitHub
Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019
☆41Sep 24, 2019Updated 6 years ago
hardyqr / HAL
View on GitHub
[AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".
☆38Oct 4, 2023Updated 2 years ago
Shiyang-Yan / Discrete-continous-PG-for-Retrieval
View on GitHub
☆13Feb 1, 2022Updated 4 years ago
PKU-ICST-MIPL / MKVSE-TOMM2023
View on GitHub
☆28May 16, 2023Updated 3 years ago
96-Zachary / vse_2ad
View on GitHub
☆15Apr 30, 2022Updated 4 years ago
AAA-Zheng / Image-Text-Matching-Summary
View on GitHub
Summary of Related Research on Image-Text Matching
☆75May 20, 2023Updated 3 years ago
CrossmodalGroup / NAAF
View on GitHub
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
☆119Jun 19, 2023Updated 3 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
ppanzx / CHAN
View on GitHub
☆54Sep 13, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
VITA-Group / ProgressiveDD
View on GitHub
[ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…
☆15May 18, 2024Updated 2 years ago
LDXDU / FedFusion-TGRS-2024
View on GitHub
☆14Sep 27, 2025Updated 9 months ago
BruceW91 / CVSE
View on GitHub
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
☆168Feb 7, 2022Updated 4 years ago
officialarijit / DFL
View on GitHub
A Docker-Based Federated Learning Framework Design and Deployment for Multi-modal Data Stream Classification
☆13Feb 15, 2024Updated 2 years ago
CEA-LIST / SCE
View on GitHub
Implementation of "Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning" WACV 2023.
☆26Sep 6, 2023Updated 2 years ago
Ruggero1912 / Patch-ioner
View on GitHub
[CVPR 2026] Official Repository of the Paper "One Patch to Caption Them All A Unified Zero-Shot Captioning Framework"
☆15Jun 4, 2026Updated last month
WuXinglong-HIT / CLIPER
View on GitHub
☆12Jul 7, 2024Updated 2 years ago
zhouyu1996 / DAQN
View on GitHub
An implement of our paper “DEEP ADVERSARIAL QUANTIZATION NETWORK FOR CROSS-MODAL RETRIEVAL”
☆10May 16, 2021Updated 5 years ago
LinLLLL / BayesCAL
View on GitHub
The official implementation of Bayesian Cross-modal Alignment Learning for Few-Shot Out-of-Distribution Generalization (AAAI2023).
☆12Oct 13, 2025Updated 9 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
hhc1997 / MSCN
View on GitHub
☆12Mar 28, 2024Updated 2 years ago
mesnico / WifiIndoorLocation
View on GitHub
Android application that localizes people in indoor environments, using wifi fingerprinting methods
☆18Sep 30, 2017Updated 8 years ago
BenjaminTMilnes / ManchuDictionary
View on GitHub
A Manchu dictionary website
☆12Feb 26, 2026Updated 4 months ago
cyh-sj / CGMN
View on GitHub
The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…
☆45Jun 5, 2023Updated 3 years ago
andreineculai / MPC
View on GitHub
☆25May 11, 2022Updated 4 years ago
imguangyu / FedPerfix
View on GitHub
Official repository for FedPerfix: Towards Partial Model Personalization of Vision Transformers in Federated Learning (ICCV2023)
☆20Dec 1, 2023Updated 2 years ago
Saehyung-Lee / PlugIR
View on GitHub
Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)
☆34Mar 24, 2025Updated last year
ZihaoWang-CV / CAMP_iccv19
View on GitHub
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
☆126Feb 26, 2020Updated 6 years ago
BonnieHuangxin / SLTA
View on GitHub
ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》
☆36Jun 19, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wenzhifang / Federated-Sketching-LoRA-Implementation
View on GitHub
☆28May 21, 2025Updated last year
GingL / CMPA
View on GitHub
☆16May 31, 2023Updated 3 years ago
xinghaow99 / DenoSent
View on GitHub
[AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
☆15Apr 29, 2024Updated 2 years ago
JiangXinni / D2A2
View on GitHub
Official PyTorch implementation of "The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignm…
☆20Dec 9, 2024Updated last year
mcahny / rovit
View on GitHub
RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"
☆17Aug 24, 2023Updated 2 years ago
OverflowCat / Manchuly
View on GitHub
A Manchu dictionary bot for Telegram.
☆11Feb 14, 2021Updated 5 years ago
ZhangXu0963 / VSL
View on GitHub
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
☆15Dec 25, 2023Updated 2 years ago