NVIDIA / ansible-role-nvidia-driver
☆112Updated last month
Related projects: ⓘ
- ☆34Updated 11 months ago
- MIG Partition Editor for NVIDIA GPUs☆163Updated this week
- The Singularity implementation of the Kubernetes Container Runtime Interface☆114Updated 3 years ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆66Updated last week
- Container plugin for Slurm Workload Manager☆278Updated last month
- GPU plugin to the node feature discovery for Kubernetes☆287Updated 3 months ago
- Singularity implementation of k8s operator for interacting with SLURM.☆118Updated 3 years ago
- Run cloud native workloads on NVIDIA GPUs☆124Updated 2 weeks ago
- Tools to deploy GPU clusters in the Cloud☆30Updated last year
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆62Updated this week
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆217Updated last week
- Nvidia-smi Prometheus exporter with respecting of GPU-UUID☆32Updated last year
- NVIDIA GPU Prometheus Exporter☆222Updated 3 years ago
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆258Updated this week
- A Slurm cluster for Kubernetes☆36Updated last month
- ☆53Updated last week
- The NetApp DataOps Toolkit is a Python library that makes it simple for developers, data scientists, DevOps engineers, and data engineers…☆46Updated 2 weeks ago
- Ansible role for installing and managing the Slurm Workload Manager☆84Updated 5 months ago
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆87Updated 2 years ago
- Installs cuda☆30Updated 2 years ago
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆26Updated 3 weeks ago
- nvidia-smi exporter for Prometheus☆72Updated 3 years ago
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆25Updated last month
- ☆43Updated 3 months ago
- ☆202Updated 2 weeks ago
- Scheduling GPU cluster workloads with Slurm☆73Updated 5 years ago
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆33Updated this week
- Prometheus exporter for a Infiniband Fabric☆52Updated 9 months ago
- nvidiagpubeat is an elastic beat that uses NVIDIA System Management Interface (nvidia-smi) to monitor NVIDIA GPU devices and can ingest m…☆54Updated 3 years ago
- NVIDIA k8s device plugin for Kubevirt☆222Updated last month