runpod-workers / worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
☆273Updated this week
Alternatives and similar repositories for worker-vllm:
Users that are interested in worker-vllm are comparing it to the libraries listed below
- A fast batching API to serve LLM models