TencentARC / UMT

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
213Updated last year

Alternatives and similar repositories for UMT:

Users that are interested in UMT are comparing it to the libraries listed below