llm-d-incubation / workload-variant-autoscalerLinks

Variant optimization autoscaler for distributed inference workloads

☆21

Alternatives and similar repositories for workload-variant-autoscaler

Users that are interested in workload-variant-autoscaler are comparing it to the libraries listed below

Sorting:

llm-d / llm-d-benchmark
llm-d benchmark scripts and tooling
☆33Updated this week
llm-d / llm-d-inference-scheduler
Inference scheduler for llm-d
☆105Updated this week
containers / nri-plugins
A collection of community maintained NRI plugins
☆97Updated last week
llm-d / llm-d-inference-sim
A light weight vLLM simulator, for mocking out replicas.
☆58Updated this week
cncf-tags / container-device-interface
☆268Updated this week
kubernetes-sigs / inference-perf
GenAI inference performance benchmarking tool
☆123Updated last week
kubernetes-sigs / jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
☆281Updated this week
intel / intel-resource-drivers-for-kubernetes
☆32Updated last week
kubernetes-sigs / dra-example-driver
Example DRA driver that developers can fork and modify to get them started writing their own.
☆105Updated 3 weeks ago
intel / platform-aware-scheduling
Enabling Kubernetes to make pod placement decisions with platform intelligence.
☆176Updated 9 months ago
NVIDIA / topograph
A toolkit for discovering cluster network topology.
☆83Updated this week
NVIDIA / k8s-nim-operator
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆136Updated this week
modelpack / model-spec
Cloud Native Artifacial Intelligence Model Format Specification
☆141Updated this week
project-codeflare / multi-cluster-app-dispatcher
Holistic job manager on Kubernetes
☆116Updated last year
checkpoint-restore / checkpoint-restore-operator
☆30Updated 2 months ago
kubernetes-sigs / kernel-module-management
The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.
☆109Updated last week
NVIDIA / knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆71Updated 4 months ago
kubernetes-sigs / wg-serving
WG Serving
☆31Updated last month
run-ai / fake-gpu-operator
☆162Updated 3 weeks ago
llm-d-incubation / llm-d-infra
llm-d helm charts and deployment examples
☆46Updated last month
openshift / instaslice-operator
InstaSlice Operator facilitates slicing of accelerators using stable APIs
☆47Updated this week
intel / cri-resource-manager
Kubernetes Container Runtime Interface proxy service with hardware resource aware workload placement policies
☆177Updated 4 months ago
llm-d / llm-d-model-service
Simplified model deployment on llm-d
☆27Updated 4 months ago
containerd / nri
Node Resource Interface
☆342Updated this week
kubernetes-sigs / gateway-api-inference-extension
Gateway API Inference Extension
☆524Updated last week
kubernetes-sigs / cluster-api-provider-kubemark
CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.
☆82Updated last week
intel / kubernetes-power-manager
☆87Updated last year
openshift / sriov-network-operator
SR-IOV Network Operator
☆144Updated this week
kubernetes-sigs / cni-dra-driver
CNI DRA Driver
☆31Updated last month
BaizeAI / kcover
🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.
☆33Updated this week