huizhang0110 / catvision

A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the performance of the open-source model Qwen-VL-7B-Chat.
β˜†14Updated 9 months ago

Related projects β“˜

Alternatives and complementary repositories for catvision