asprenger / ray_vllm_inference

A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.
60Updated 9 months ago

Alternatives and similar repositories for ray_vllm_inference:

Users that are interested in ray_vllm_inference are comparing it to the libraries listed below