dwarvesf / llm-hosting

This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β˜†16Updated 3 weeks ago

Related projects β“˜

Alternatives and complementary repositories for llm-hosting