ictnlp / LLaVA-Mini

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
474Updated 4 months ago

Alternatives and similar repositories for LLaVA-Mini

Users that are interested in LLaVA-Mini are comparing it to the libraries listed below

Sorting: