NVIDIA / k8s-nim-operator
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆88Updated last week
Alternatives and similar repositories for k8s-nim-operator:
Users that are interested in k8s-nim-operator are comparing it to the libraries listed below
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆27Updated 3 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆63Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆194Updated last week
- GenAI inference performance benchmarking tool☆19Updated last week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆330Updated this week
- ☆34Updated this week
- K8s device plugin for GPU sharing☆100Updated last year
- ☆50Updated last year
- ☆112Updated last week
- ☆85Updated 6 months ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆29Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆63Updated last week
- Gateway API Inference Extension☆179Updated last week
- Holistic job manager on Kubernetes☆112Updated last year
- A toolkit for discovering cluster network topology.☆37Updated this week
- This project provides a framework that runs Slurm in Kubernetes.☆64Updated 2 weeks ago
- Smart Kubernetes Scheduling☆76Updated this week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆89Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆339Updated this week
- ☆60Updated last week
- Containerization and cloud native suite for OPEA☆44Updated this week
- This repo includes everything you need to know about deploying GPU nodes on OCI☆25Updated this week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆174Updated last month
- MIG Partition Editor for NVIDIA GPUs☆190Updated this week
- Run cloud native workloads on NVIDIA GPUs☆163Updated 3 weeks ago
- ☆94Updated 2 months ago
- ☆107Updated this week
- ☆21Updated last week
- WG Serving☆20Updated last month
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆99Updated this week