pepijndevos / llama_multiserver

A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM
12Updated last month

Alternatives and similar repositories for llama_multiserver:

Users that are interested in llama_multiserver are comparing it to the libraries listed below