wzhings / itmAFALinks

This repo is for the implementation of Enhancing Image-Text Matching with Adaptive Feature Aggregation, ICASSP 2024

☆9

Alternatives and similar repositories for itmAFA

Users that are interested in itmAFA are comparing it to the libraries listed below

Sorting:

ZhangXu0963 / NPC
The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.
☆22Updated last month
lerogo / aaai24_itr_cusa
Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"
☆46Updated last year
zhangy0822 / USER
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆32Updated last month
QinYang79 / CRCL
Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)
☆16Updated last year
taewhankim / VIPCAP
☆12Updated 7 months ago
ppanzx / CHAN
☆49Updated last year
vkhoi / cora_cvpr24
☆25Updated 11 months ago
ZhangXu0963 / VSL
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
☆14Updated last year
Paranioar / RCAR
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
☆33Updated last year
CrossmodalGroup / ESL
☆12Updated last year
QinYang79 / Awesome-Noisy-Correspondence
This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…
☆68Updated 3 weeks ago
WayneTomas / TransCP
[TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…
☆18Updated 3 months ago
hhc1997 / L2RM
☆34Updated last year
LuminosityX / HAT
Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'
☆25Updated last year
ZiChao111 / FTI4CIR
Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval
☆25Updated 4 months ago
leolee99 / PAU
The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" accepted by NeurIPS…
☆25Updated last year
musicman217 / Text-Proxy
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025
☆13Updated 3 weeks ago
haokunwen / DQU-CIR
[SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
☆40Updated last year
AAA-Zheng / Image-Text-Matching-Summary
Summary of Related Research on Image-Text Matching
☆70Updated 2 years ago
gaojingsheng / LAMM
Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024
☆32Updated last year
DarrenZZhang / MM23-MITH
☆19Updated last year
KevinLight831 / AMC
[ToMM2023] - AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval
☆20Updated 11 months ago
LCFractal / TGDT
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training
☆27Updated 2 years ago
chunmeifeng / SPRC
【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval
☆85Updated last year
yl3800 / TranSTR
☆12Updated last year
Ziyang412 / UCoFiA
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
☆65Updated last year
hhc1997 / MSCN
☆10Updated last year
lntzm / MESM
The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)
☆30Updated last year
Delong-liu-bupt / Composed_Person_Retrieval
Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale person image databas…
☆25Updated 2 months ago
PKU-ICST-MIPL / MKVSE-TOMM2023
☆27Updated 2 years ago