mesolitica / multimodal-LLM

Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.
β˜†13Updated 8 months ago

Related projects β“˜

Alternatives and complementary repositories for multimodal-LLM