kubernetes-sigs / wg-servingLinks
WG Serving
☆27Updated last week
Alternatives and similar repositories for wg-serving
Users that are interested in wg-serving are comparing it to the libraries listed below
Sorting:
- GenAI inference performance benchmarking tool☆58Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆74Updated last month
- A CRD for arbitrary properties about a cluster☆34Updated last month
- Kubernetes Work API☆66Updated last month
- ☆52Updated last year
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆94Updated 2 months ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆38Updated this week
- Inference scheduler for llm-d☆56Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆67Updated last month
- Smart Kubernetes Scheduling☆79Updated this week
- ☆37Updated this week
- Kubernetes ClusterInventory API☆71Updated 3 months ago
- K8s device plugin for GPU sharing☆98Updated 2 years ago
- Cloud Native Artifacial Intelligence Model Format Specification☆63Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆236Updated last week
- ☆16Updated 3 months ago
- Operator for managing Node Feature Discovery deployment☆70Updated 3 weeks ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12Updated 2 years ago
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆75Updated 3 weeks ago
- ☆157Updated last week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆28Updated 6 months ago
- Shared library for use by volume populators.☆28Updated last month
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆114Updated this week
- Gateway API Inference Extension☆351Updated this week
- AppWrapper controller for Kueue☆14Updated last week
- CNI DRA Driver☆26Updated 5 months ago
- Minimum cluster registration and work☆58Updated 9 months ago
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec☆28Updated this week
- Libraries for implementing aggregated apiservers☆90Updated 2 months ago
- This repo contains sidecar controller and agent for volume health monitoring.☆68Updated last week