fly-dragon211 / tth
Source code of our ICASSP2023 paper: Towards Making a Trojan-horse Attack on Text-to-Image Retrieval.
☆9Updated last year
Related projects ⓘ
Alternatives and complementary repositories for tth
- ☆34Updated 2 years ago
- ☆63Updated last year
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆39Updated 7 months ago
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆18Updated 9 months ago
- Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))☆145Updated 2 years ago
- Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval ( ACM Multimedia 2022, Pytorch Code)☆41Updated 8 months ago
- FedCMR: Federated Cross-Modal Retrieval 的代码(the official implementation of FedCMR: Federated Cross-Modal Retrieval)☆11Updated last year
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆44Updated last month
- Repository for an end-to-end image captioning method PTSN(ACM MM22).☆60Updated last year
- CCD: Official PyTorch implementation of the paper "Contextual Debiasing for Visual Recognition with Causal Mechanisms"☆16Updated last year
- ☆25Updated last year
- ☆11Updated last month
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆108Updated last year
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆31Updated 7 months ago
- Summary of Related Research on Image-Text Matching☆67Updated last year
- ☆10Updated 4 months ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆45Updated 2 years ago
- This is the source code for Detecting Adversarial Data by Probing Multiple Perturbations Using Expected Perturbation Score (ICML2023).☆33Updated last month
- Official implementation of "ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"☆73Updated last year
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆66Updated 2 years ago
- NegCLIP.☆26Updated last year
- ☆17Updated 4 months ago
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆30Updated last year
- This repo contains code for Invariant Grounding for Video Question Answering☆26Updated last year
- ☆34Updated last year
- ☆89Updated last year
- [CVPR 2024] Official repository of the paper "Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Vid…☆33Updated 2 weeks ago
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆106Updated 2 years ago
- Official Implementation of LADS (Latent Augmentation using Domain descriptionS)☆49Updated last year