runpod-workers / worker-sglang
SGLang is fast serving framework for large language models and vision language models.
☆10Updated this week
Related projects ⓘ
Alternatives and complementary repositories for worker-sglang
- Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding