NVIDIA / k8s-nim-operator
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆92Updated this week
Alternatives and similar repositories for k8s-nim-operator:
Users that are interested in k8s-nim-operator are comparing it to the libraries listed below
- GenAI inference performance benchmarking tool☆36Updated 2 weeks ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆69Updated 3 weeks ago
- Holistic job manager on Kubernetes☆115Updated last year
- This project provides a framework that runs Slurm in Kubernetes.☆75Updated 2 weeks ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆33Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆64Updated 3 weeks ago
- ☆100Updated 3 weeks ago
- ☆126Updated this week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆92Updated last week
- A toolkit for discovering cluster network topology.☆45Updated last week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆27Updated 4 months ago
- MIG Partition Editor for NVIDIA GPUs☆193Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆218Updated this week
- Gateway API Inference Extension☆229Updated this week
- ☆85Updated 7 months ago
- ☆35Updated this week
- Run cloud native workloads on NVIDIA GPUs☆168Updated last week
- ☆38Updated 2 weeks ago
- K8s device plugin for GPU sharing☆100Updated last year
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆341Updated last week
- WG Serving☆23Updated this week
- Containerization and cloud native suite for OPEA☆50Updated last week
- ☆247Updated last week
- ☆51Updated last year
- The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.☆99Updated this week
- This repo includes everything you need to know about deploying GPU nodes on OCI☆26Updated last week
- A Slurm cluster for Kubernetes☆55Updated 8 months ago
- Smart Kubernetes Scheduling☆78Updated last week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆174Updated 2 months ago
- Slurm in Kubernetes☆41Updated 4 months ago