heldJan / X-VARSView on GitHub
X-VARS is a multi-modal large language model designed for understanding football videos from the point of view of a referee. X-VARS can perform a multitude of tasks, including video description, question answering, action recognition, and conducting meaningful conversations based on video content.
23Jun 18, 2024Updated last year

Alternatives and similar repositories for X-VARS

Users that are interested in X-VARS are comparing it to the libraries listed below

Sorting:

Are these results useful?