llm-d-incubation / workload-variant-autoscalerLinks

Variant optimization autoscaler for distributed inference workloads
21Updated this week

Alternatives and similar repositories for workload-variant-autoscaler

Users that are interested in workload-variant-autoscaler are comparing it to the libraries listed below

Sorting: