gpustack / llama-box

LLM inference server implementation based on llama.cpp.
25Updated this week

Related projects

Alternatives and complementary repositories for llama-box