niluthpol / multimodal_vttView on GitHub
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
68Apr 10, 2020Updated 5 years ago

Alternatives and similar repositories for multimodal_vtt

Users that are interested in multimodal_vtt are comparing it to the libraries listed below

Sorting:

Are these results useful?