heldJan / X-VARS

X-VARS is a multi-modal large language model designed for understanding football videos from the point of view of a referee. X-VARS can perform a multitude of tasks, including video description, question answering, action recognition, and conducting meaningful conversations based on video content.
13Updated 4 months ago

Related projects

Alternatives and complementary repositories for X-VARS