huizhang0110 / catvision

A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the performance of the open-source model Qwen-VL-7B-Chat.
14Updated last year

Alternatives and similar repositories for catvision:

Users that are interested in catvision are comparing it to the libraries listed below