llm-d / llm-d-model-serviceLinks
Simplified model deployment on llm-d
☆23Updated 2 weeks ago
Alternatives and similar repositories for llm-d-model-service
Users that are interested in llm-d-model-service are comparing it to the libraries listed below
Sorting:
- Inference scheduler for llm-d☆56Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆74Updated 3 weeks ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆38Updated this week
- Holistic job manager on Kubernetes☆116Updated last year
- GenAI inference performance benchmarking tool☆55Updated 2 weeks ago
- ☆55Updated this week
- RukPak runs in a Kubernetes cluster and defines APIs for installing cloud native content☆51Updated 10 months ago
- open-cluster-management governance material.☆64Updated 3 months ago
- Operator for managing Node Feature Discovery deployment☆70Updated 3 weeks ago
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆129Updated this week
- Kubernetes Work API☆66Updated last month
- A benchmarking tool to evaluate Knative performance☆38Updated last year
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆94Updated 2 months ago
- A template project for writing your own controller using the Knative helper libraries.☆73Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆28Updated 6 months ago
- AppWrapper controller for Kueue☆14Updated last week
- hub / spoke registration controllers☆42Updated 8 months ago
- ☆37Updated this week
- Helm charts for llm-d☆42Updated this week
- KJob: Tool for CLI-loving ML researchers☆30Updated last week
- Cloud Native Artifacial Intelligence Model Format Specification☆63Updated this week
- The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.☆103Updated last week
- Distributed KV cache coordinator☆34Updated 2 weeks ago
- KubeStellar - a flexible solution for multi-cluster configuration management for edge, multi-cloud, and hybrid cloud☆413Updated this week
- WG Serving☆27Updated last week
- CSI driver to bootstrap COSI workloads☆18Updated 2 years ago
- Libraries for implementing aggregated apiservers☆90Updated 2 months ago
- A collection of community maintained NRI plugins☆82Updated this week
- Container Object Storage Interface (COSI) API responsible to define API for COSI objects. NOTE: The content of this repo has been moved t…☆70Updated 6 months ago
- Generates Kubernetes CRD API reference documentation☆133Updated last week