Pyligent / Fashion-Image-Text-Multimodal-retrieval
Joint Image and textual feature Fashion Style search
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Fashion-Image-Text-Multimodal-retrieval
- Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning☆63Updated 4 years ago
- Fashion 200K dataset used in paper "Automatic Spatially-aware Fashion Concept Discovery."☆63Updated 2 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆57Updated 5 years ago
- ☆20Updated 5 years ago
- ☆16Updated 3 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Updated 4 years ago
- Deep Cross-Modal Projection Learning for Image-Text Matching☆73Updated 4 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆34Updated 4 years ago
- Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval☆56Updated 3 years ago
- This repository contains an implementation of the models introduced in the paper Dialog-based Interactive Image Retrieval. The network is…☆68Updated 4 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆69Updated 4 years ago
- Ad-hoc Video Search☆27Updated 3 years ago
- ☆27Updated 4 years ago
- CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval☆127Updated 4 years ago
- A multimodal embedding of images and captions, built with PyTorch, written with Python3.☆29Updated 5 years ago
- The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral☆67Updated 5 years ago
- ☆10Updated 5 years ago
- The code of the paper, FashionNet. Using Keras, on Jupyter☆14Updated 5 years ago
- This document covers various papers about multi-label for reference☆25Updated 4 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 4 years ago
- ☆15Updated 7 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆37Updated last year
- ☆62Updated 2 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Updated last year
- Modality-Agnostic Attention Fusion for visual search with text feedback☆25Updated last year
- Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)☆134Updated 8 months ago
- Extended Annotations of DeepFashion Images for Fine-grained Recognition☆13Updated 5 years ago
- the tensorflow code for Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval☆37Updated 6 years ago
- code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"☆92Updated 4 years ago