kubernetes-sigs / wg-serving
WG Serving
☆23Updated last month
Alternatives and similar repositories for wg-serving:
Users that are interested in wg-serving are comparing it to the libraries listed below
- GenAI inference performance benchmarking tool☆36Updated 2 weeks ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆69Updated 3 weeks ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆64Updated 3 weeks ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆33Updated this week
- Smart Kubernetes Scheduling☆78Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆92Updated this week
- ☆35Updated this week
- ☆51Updated last year
- K8s device plugin for GPU sharing☆100Updated last year
- ☆125Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆218Updated this week
- A CRD for arbitrary properties about a cluster☆34Updated 3 weeks ago
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆72Updated 2 weeks ago
- Kubernetes Work API☆66Updated 3 weeks ago
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆93Updated 3 months ago
- Operator for Multi-Cluster Monitoring with Thanos.☆132Updated this week
- Cloud Native Artifacial Intelligence Model Format Specification☆41Updated this week
- Operator for managing Node Feature Discovery deployment☆69Updated 3 weeks ago
- Gateway API Inference Extension☆229Updated this week
- Minimum cluster registration and work☆58Updated 6 months ago
- The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.☆99Updated this week
- ErieCanal is a MCS(multi cluster service https://github.com/kubernetes-sigs/mcs-api) implementation, provides MCS, Ingress, Egress, Gatew…☆50Updated last year
- CNI DRA Driver☆24Updated 3 months ago
- eXtThrottle - Extended resource quota controller and Admission Webhook for using GPU in kubernetes.☆17Updated 7 years ago
- 🎉 An awesome & curated list of best LLMOps tools.☆83Updated last week
- Last Week in Kubernetes Development☆149Updated this week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆120Updated last week
- CoreDNS plugin implementing K8s multi-cluster services DNS spec.☆49Updated 3 weeks ago
- hub / spoke registration controllers☆42Updated 5 months ago
- ☆114Updated 2 years ago