AI-Hypercomputer / JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
275Updated this week

Alternatives and similar repositories for JetStream:

Users that are interested in JetStream are comparing it to the libraries listed below