runpod-workers / worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
244Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for worker-vllm