Skyline-9 / Visionary-Vids

Multi-modal transformer approach for natural language query based joint video summarization and highlight detection
13Updated 7 months ago

Alternatives and similar repositories for Visionary-Vids:

Users that are interested in Visionary-Vids are comparing it to the libraries listed below