heldJan / X-VARS

X-VARS is a multi-modal large language model designed for understanding football videos from the point of view of a referee. X-VARS can perform a multitude of tasks, including video description, question answering, action recognition, and conducting meaningful conversations based on video content.
14Updated 8 months ago

Alternatives and similar repositories for X-VARS:

Users that are interested in X-VARS are comparing it to the libraries listed below