NVIDIA / k8s-nim-operatorLinks

An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.

☆119

Alternatives and similar repositories for k8s-nim-operator

Users that are interested in k8s-nim-operator are comparing it to the libraries listed below

Sorting:

kubernetes-sigs / inference-perf
GenAI inference performance benchmarking tool
☆71Updated this week
kubernetes-sigs / jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
☆246Updated this week
NVIDIA / topograph
A toolkit for discovering cluster network topology.
☆59Updated last week
NVIDIA / k8s-dra-driver-gpu
NVIDIA DRA Driver for GPUs
☆400Updated last week
kubernetes-sigs / dra-example-driver
Example DRA driver that developers can fork and modify to get them started writing their own.
☆85Updated this week
NVIDIA / nvkind
☆162Updated last week
kubernetes-sigs / wg-serving
WG Serving
☆28Updated last week
NVIDIA / KAI-Scheduler
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆723Updated this week
kubernetes-sigs / gateway-api-inference-extension
Gateway API Inference Extension
☆415Updated this week
cncf-tags / container-device-interface
☆254Updated last month
NVIDIA / cloud-native-stack
Run cloud native workloads on NVIDIA GPUs
☆188Updated this week
intel / platform-aware-scheduling
Enabling Kubernetes to make pod placement decisions with platform intelligence.
☆176Updated 6 months ago
NVIDIA / knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆69Updated 2 weeks ago
project-codeflare / multi-cluster-app-dispatcher
Holistic job manager on Kubernetes
☆117Updated last year
kubeflow / model-registry
Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…
☆138Updated last week
run-ai / fake-gpu-operator
☆130Updated 2 weeks ago
cncf / tag-runtime
🏃🏿‍♀️🏃🏽‍♀️🏃🏻‍♂️🕒CNCF Technical Advisory Group for Runtime
☆95Updated 3 months ago
cnvrg / metagpu
K8s device plugin for GPU sharing
☆98Updated 2 years ago
project-codeflare / instaslice
InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing
☆29Updated 8 months ago
openshift / instaslice-operator
InstaSlice Operator facilitates slicing of accelerators using stable APIs
☆41Updated this week
NVIDIA / ais-k8s
Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
☆106Updated last week
kubernetes-sigs / lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
☆526Updated this week
NVIDIA / mig-parted
MIG Partition Editor for NVIDIA GPUs
☆204Updated last week
opea-project / GenAIInfra
Containerization and cloud native suite for OPEA
☆69Updated 2 weeks ago
SlinkyProject / slurm-operator
Run Slurm on Kubernetes. A Slinky project.
☆138Updated 2 weeks ago
kubernetes / dynamic-resource-allocation
☆37Updated this week
llm-d / llm-d-deployer
Helm charts for llm-d
☆51Updated last week
llmnetes / llmnetes
☆59Updated last year
llm-d / llm-d-inference-scheduler
Inference scheduler for llm-d
☆68Updated this week
NVIDIA / gpu-driver-container
The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
☆121Updated last week