niluthpol / multimodal_vtt

Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
67Updated 4 years ago

Alternatives and similar repositories for multimodal_vtt:

Users that are interested in multimodal_vtt are comparing it to the libraries listed below