TencentARC / UMT

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
192Updated 7 months ago

Related projects

Alternatives and complementary repositories for UMT