bytedance / tarsierView on GitHub
Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.
547Aug 14, 2025Updated 10 months ago

Alternatives and similar repositories for tarsier

Users that are interested in tarsier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?