ictnlp / LLaVA-Mini

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
302Updated 2 weeks ago

Alternatives and similar repositories for LLaVA-Mini:

Users that are interested in LLaVA-Mini are comparing it to the libraries listed below