microsoft / sarathi-serveView on GitHub
A low-latency & high-throughput serving engine for LLMs
480Jan 8, 2026Updated last month

Alternatives and similar repositories for sarathi-serve

Users that are interested in sarathi-serve are comparing it to the libraries listed below

Sorting:

Are these results useful?