abgulati / kosmos-2_5-containerized

Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-script based invocation make it difficult to use for application development. Here, it has been containerized and made available via an API, greatly enhancing its ease-of-use.
57Updated 5 months ago

Alternatives and similar repositories for kosmos-2_5-containerized:

Users that are interested in kosmos-2_5-containerized are comparing it to the libraries listed below