niluthpol / multimodal_vtt

Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
69Updated 4 years ago

Related projects

Alternatives and complementary repositories for multimodal_vtt