kubernetes-sigs / wg-servingLinks
WG Serving
☆31Updated last month
Alternatives and similar repositories for wg-serving
Users that are interested in wg-serving are comparing it to the libraries listed below
Sorting:
- GenAI inference performance benchmarking tool☆133Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆140Updated last week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆109Updated last month
- Kubernetes Work API☆68Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆282Updated last week
- K8s device plugin for GPU sharing☆99Updated 2 years ago
- llm-d helm charts and deployment examples☆46Updated last week
- ☆182Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆72Updated 4 months ago
- Cloud Native Artifacial Intelligence Model Format Specification☆144Updated last week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆47Updated last week
- Holistic job manager on Kubernetes☆115Updated last year
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆95Updated 7 months ago
- Inference scheduler for llm-d☆106Updated this week
- Smart Kubernetes Scheduling☆81Updated last week
- CNI DRA Driver☆31Updated 2 months ago
- Operator for managing Node Feature Discovery deployment☆73Updated 3 months ago
- A CRD for arbitrary properties about a cluster☆37Updated 2 months ago
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Updated 4 months ago
- ☆38Updated this week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec☆50Updated last week
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆85Updated last week
- ☆116Updated 3 weeks ago
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆154Updated this week
- KJob: Tool for CLI-loving ML researchers☆39Updated last week
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12Updated 2 years ago
- ☆33Updated 3 weeks ago
- ☆62Updated last year
- A Topology-Aware Custom Scheduler For Kubernetes☆66Updated 2 years ago
- Incubating P/D sidecar for llm-d☆16Updated 2 weeks ago