NVIDIA / k8s-nim-operator
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆82Updated this week
Alternatives and similar repositories for k8s-nim-operator:
Users that are interested in k8s-nim-operator are comparing it to the libraries listed below
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆231Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆57Updated 3 weeks ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆186Updated this week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆312Updated this week
- K8s device plugin for GPU sharing☆99Updated last year
- GenAI inference performance benchmarking tool☆16Updated this week
- ☆33Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆61Updated 3 weeks ago
- ☆19Updated last month
- ☆91Updated last month
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆89Updated this week
- Gateway API Inference Extension☆146Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆27Updated 2 months ago
- Holistic job manager on Kubernetes☆111Updated 11 months ago
- ☆49Updated 11 months ago
- ☆101Updated this week
- ☆104Updated this week
- Run cloud native workloads on NVIDIA GPUs☆156Updated last week
- A Topology-Aware Custom Scheduler For Kubernetes☆63Updated last year
- ☆85Updated 5 months ago
- Slurm in Kubernetes☆41Updated 2 months ago
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆91Updated this week
- NVIDIA Network Operator☆230Updated this week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆174Updated 2 weeks ago
- MIG Partition Editor for NVIDIA GPUs☆185Updated this week
- This repo includes everything you need to know about deploying GPU nodes on OCI☆24Updated last week
- ☆237Updated last week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆66Updated this week
- Smart Kubernetes Scheduling☆73Updated this week
- The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.☆95Updated this week