NVIDIA / ansible-role-nvidia-driver
☆117Updated 5 months ago
Alternatives and similar repositories for ansible-role-nvidia-driver:
Users that are interested in ansible-role-nvidia-driver are comparing it to the libraries listed below
- ☆35Updated last year
- MIG Partition Editor for NVIDIA GPUs☆185Updated this week
- NVIDIA GPU Prometheus Exporter☆231Updated 3 years ago
- A Slurm-based HPC workload management environment, driven by Ansible.☆52Updated this week
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆28Updated 5 months ago
- Nvidia-smi Prometheus exporter with respecting of GPU-UUID☆34Updated last year
- Ansible role for installing and managing the Slurm Workload Manager☆90Updated last week
- nvidiagpubeat is an elastic beat that uses NVIDIA System Management Interface (nvidia-smi) to monitor NVIDIA GPU devices and can ingest m…☆53Updated 4 years ago
- Bare Metal Provisioning system for HPC Linux clusters☆58Updated this week
- Prometheus exporter for slurm job/node data☆33Updated 5 months ago
- Export select slurm metrics to prometheus☆47Updated last week
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆297Updated last week
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆231Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆296Updated 8 months ago
- A Slurm cluster for Kubernetes☆50Updated 6 months ago
- Tools to deploy GPU clusters in the Cloud☆30Updated last year
- Packer plugin for Proxmox Builder☆165Updated last week
- Terraform Harvester provider☆74Updated last week
- ☆59Updated 4 months ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆88Updated last week
- Prometheus exporter for performance metrics from Slurm.☆247Updated 7 months ago
- COSI driver for Ceph Object Store aka RGW☆40Updated 7 months ago
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆85Updated this week
- Prometheus exporter for a Infiniband Fabric☆57Updated last year
- Container plugin for Slurm Workload Manager☆314Updated 2 months ago
- ☆20Updated this week
- The NetApp DataOps Toolkit is a Python library that makes it simple for developers, data scientists, DevOps engineers, and data engineers…☆3Updated 4 months ago
- Slurm in Kubernetes☆40Updated last month
- Ansible collection for installing k3sup a light-weight utility to get from zero to KUBECONFIG with k3s on any local or remote VM☆31Updated 2 years ago
- nvidia-smi exporter for Prometheus☆73Updated 3 years ago