Wangt-CN / Image-text-matching
An building code for a new framework in image-text matching task
☆11Updated 5 years ago
Alternatives and similar repositories for Image-text-matching:
Users that are interested in Image-text-matching are comparing it to the libraries listed below
- A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.☆18Updated 5 years ago
- image captioning paper list☆8Updated 5 years ago
- ☆14Updated 5 years ago
- ☆18Updated last year
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆37Updated last year
- Phrase Localization Evaluation Toolkit☆19Updated 5 years ago
- The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral☆68Updated 5 years ago
- ☆11Updated 2 years ago
- ☆19Updated 2 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 4 years ago
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features☆12Updated 3 years ago
- This is an implementation of image caption, based on two different papers. The two papers are: 1. Show and Tell: A Neural Image Caption G…☆30Updated 5 years ago
- A large scale dataset for Video Captioning in Italian☆12Updated last year
- ☆10Updated last year
- Published in CVPR 2020; matlab codes☆22Updated 5 months ago
- Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144☆57Updated last year
- Deep Cross-Modal Projection Learning for Image-Text Matching☆74Updated 4 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Updated 2 years ago
- A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering☆41Updated 4 years ago
- Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning☆82Updated 3 years ago
- ☆24Updated 2 years ago
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆47Updated 3 years ago
- UniVSE implementation on Python3☆10Updated 4 years ago
- A paper list of visual semantic embeddings and text-image retrieval.☆41Updated 4 years ago
- ☆11Updated 6 years ago
- Who's Waldo? Linking People Across Text and Images. ICCV 2021.☆13Updated last year
- A length-controllable and non-autoregressive image captioning model.☆68Updated 3 years ago
- Official codes for paper "Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning".☆13Updated 3 years ago
- [ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval☆30Updated last year
- PyTorch implementation of Data-Efficient Image Recognition with Contrastive Predictive Coding☆13Updated 4 years ago