huizhang0110 / catvision
View external linksLinks

A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the performance of the open-source model Qwen-VL-7B-Chat.
14Feb 5, 2024Updated 2 years ago

Alternatives and similar repositories for catvision

Users that are interested in catvision are comparing it to the libraries listed below

Sorting:

Are these results useful?