NVIDIA / gpu-operator
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
☆2,108Updated this week
Alternatives and similar repositories for gpu-operator:
Users that are interested in gpu-operator are comparing it to the libraries listed below
- NVIDIA device plugin for Kubernetes☆3,196Updated this week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,174Updated 3 weeks ago
- Tools for monitoring NVIDIA GPUs on Linux☆1,036Updated 3 years ago
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆352Updated last week
- GPU plugin to the node feature discovery for Kubernetes☆300Updated 11 months ago
- Node feature discovery for Kubernetes☆866Updated this week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,463Updated last year
- Kubernetes-native Job Queueing☆1,764Updated this week
- Tools for building GPU clusters☆1,341Updated 3 weeks ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆475Updated 3 weeks ago
- Kubeflow Deployment Manifests☆912Updated this week
- GPU Sharing Device Plugin for Kubernetes Cluster☆480Updated 2 years ago
- This driver allows Kubernetes to access NFS server on Linux node.☆1,001Updated last week
- CSI driver for Ceph☆1,380Updated this week
- Kubernetes Cluster Federation☆2,498Updated 2 years ago
- A CNI meta-plugin for multi-homed pods in Kubernetes☆2,570Updated 2 weeks ago
- Dynamically provisioning persistent local storage with Kubernetes☆2,463Updated 3 weeks ago
- This is a place for various problem detectors running on the Kubernetes nodes.☆3,136Updated last week
- A management framework for extending Kubernetes with Operators☆1,769Updated this week
- Elastic Cloud on Kubernetes☆2,707Updated last week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,091Updated last year
- ☆875Updated last year
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆323Updated this week
- Home for Cluster API, a subproject of sig-cluster-lifecycle☆3,786Updated this week
- CLI and validation tools for Kubelet Container Runtime Interface (CRI) .☆1,804Updated last week
- Simple Kubernetes Operator for MinIO clusters☆1,294Updated this week
- Repository for the next iteration of composite service (e.g. Ingress) and load balancing APIs.☆2,056Updated last week
- NVIDIA container runtime library☆946Updated 2 weeks ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,195Updated last week
- A Cloud Native Batch System (Project under CNCF)☆4,627Updated this week