phospho-app / fastassert

Dockerized LLM inference server with constrained output (JSON mode), built on top of vLLM and outlines. Faster, cheaper and without rate limits. Compare the quality and latency to your current LLM API provider.
27Updated 11 months ago

Alternatives and similar repositories for fastassert:

Users that are interested in fastassert are comparing it to the libraries listed below