Wangt-CN / Code_CASCLinks

☆14

Alternatives and similar repositories for Code_CASC

Users that are interested in Code_CASC are comparing it to the libraries listed below

Sorting:

zhixiongz / CLIP4CMR
A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval
☆42Updated 3 years ago
LgQu / DIME
Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21
☆71Updated 3 years ago
woodfrog / vse_infty
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)
☆160Updated 2 years ago
layer6ai-labs / xpool
https://layer6ai-labs.github.io/xpool/
☆125Updated 2 years ago
CrossmodalGroup / NAAF
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
☆118Updated 2 years ago
ioanacroi / qb-norm
Cross Modal Retrieval with Querybank Normalisation
☆55Updated last year
XLearning-SCU / 2021-NeurIPS-NCR
☆74Updated last year
AAA-Zheng / Listwise_ITR
Official PyTorch implementation of the paper "Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval"
☆8Updated 2 years ago
CrossmodalGroup / BFAN
Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
☆38Updated 2 years ago
fortunechen / paper-reading_CrossModelGroup-USTC
中科大跨模态智能组-每周论文分享
☆16Updated 2 years ago
m2man / LGSGM
☆34Updated 3 years ago
kywen1119 / DSRAN
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
☆72Updated 2 years ago
Paranioar / SGRAF
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
☆217Updated last year
mesnico / TERAN
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…
☆74Updated last year
foolwood / DRL
[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
☆96Updated 3 years ago
baiyang4 / D-LSG-Video-Caption
☆27Updated 3 years ago
Huntersxsx / TSGV-Learning-List
Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作
☆29Updated 3 years ago
ycWang9725 / WSTAN
☆16Updated 3 years ago
LiuRicky / ts2_net
[ECCV2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
☆77Updated 2 years ago
Roc-Ng / HANet
PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).
☆47Updated 3 years ago
CrossmodalGroup / ER-SAN
Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.
☆23Updated 2 years ago
cyh-sj / CGMN
The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…
☆45Updated 2 years ago
CrossmodalGroup / GSMN
Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching
☆167Updated 4 years ago
bladewaltz1 / PromptSwitch
☆30Updated last year
QinYang79 / DECL
Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval ( ACM Multimedia 2022, Pytorch Code)
☆44Updated last year
HuiGuanLab / ms-sl
Source code of our MM'22 paper Partially Relevant Video Retrieval
☆54Updated 9 months ago
doc-doc / HQGA
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)
☆34Updated 2 years ago
boheumd / A2Summ
The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)
☆78Updated 2 years ago
MCG-NJU / MMN
[AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding
☆90Updated 2 years ago
bofang98 / UATVR
[ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval
☆13Updated last year