rsreetech / MultiModalSearch

In this repository I demonstrate how you can perform multimodal(image+text) search to find similar images+texts given a test image+text from a multimodal (texts+images) database . I use the Kaggle Shopee dataset. I use Tensorflow MobileNet CNN and hugging face sentence transformers BERT to extract image and text embeddings to create a joint embe…
12Updated 3 years ago

Related projects

Alternatives and complementary repositories for MultiModalSearch