NVIDIA / gpu-operatorLinks
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
☆2,526Updated this week
Alternatives and similar repositories for gpu-operator
Users that are interested in gpu-operator are comparing it to the libraries listed below
Sorting:
- NVIDIA device plugin for Kubernetes☆3,650Updated this week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,597Updated last week
- NVIDIA DRA Driver for GPUs☆553Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,111Updated this week
- Kubernetes-native Job Queueing☆2,298Updated this week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,527Updated 2 years ago
- Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)☆2,977Updated this week
- Tools for monitoring NVIDIA GPUs on Linux☆1,068Updated 4 years ago
- Node feature discovery for Kubernetes☆993Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆308Updated last year
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆371Updated 2 weeks ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆656Updated last week
- A toolkit to run Ray applications on Kubernetes☆2,305Updated this week
- NVIDIA device plugin for Kubernetes☆49Updated last year
- Kubeflow Deployment Manifests☆989Updated this week
- Simple Kubernetes Operator for MinIO clusters☆1,417Updated last month
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,021Updated this week
- Dynamically provisioning persistent local storage with Kubernetes☆2,764Updated 3 weeks ago
- CSI driver for Ceph☆1,503Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆510Updated 2 weeks ago
- Gateway API Inference Extension☆576Updated this week
- This driver allows Kubernetes to access NFS server on Linux node.☆1,209Updated this week
- Kubernetes CSI driver for Direct Attached Storage☆700Updated last week
- ☆892Updated last year
- CLI and validation tools for Kubelet Container Runtime Interface (CRI) .☆1,946Updated last week
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,267Updated 2 months ago
- Tools for building GPU clusters☆1,414Updated 3 weeks ago
- Repository for the next iteration of composite service (e.g. Ingress) and load balancing APIs.☆2,617Updated this week
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,139Updated last week
- NVIDIA container runtime library☆1,068Updated last week