NVIDIA / ansible-role-nvidia-docker
☆35Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ansible-role-nvidia-docker
- ☆117Updated 3 months ago
- Singularity implementation of k8s operator for interacting with SLURM.☆117Updated 3 years ago
- Slurm in Docker - Exploring Slurm using CentOS 7 based Docker images☆120Updated 5 years ago
- Scheduling GPU cluster workloads with Slurm☆74Updated 6 years ago
- The System Stacks for Linux* OS are a collection of production ready docker images for Deep Learning, Media and Storage optimized for 2nd…☆33Updated last year
- The Singularity implementation of the Kubernetes Container Runtime Interface☆114Updated 3 years ago
- Tools to deploy GPU clusters in the Cloud☆30Updated last year
- Run cloud native workloads on NVIDIA GPUs☆134Updated this week
- Prometheus Exporter for NVIDIA GPUs using NVML☆73Updated 4 years ago
- Ansible role for installing and managing the Slurm Workload Manager☆88Updated 7 months ago
- A top-like tool for monitoring GPUs in a cluster☆81Updated 9 months ago
- Container plugin for Slurm Workload Manager☆294Updated 2 weeks ago
- NGC Container Replicator☆28Updated last year
- Spawn JupyterHub single-user servers with ssh☆24Updated last year
- MIG Partition Editor for NVIDIA GPUs☆174Updated this week
- Prometheus exporter for performance metrics from Slurm.☆236Updated 5 months ago
- server for storage and management of singularity images☆103Updated 4 months ago
- files and instructions for creating and using example containers from the sylabs.io blog☆103Updated last year
- OCI-compatible engine to deploy Linux containers on HPC environments.☆129Updated 3 weeks ago
- slurm-docker-integration provides HPC-Kubernetes integration artifacts☆23Updated 7 months ago
- Prometheus GPU Metrics Exporter☆18Updated 6 years ago
- GPU plugin to the node feature discovery for Kubernetes☆293Updated 5 months ago
- Experiments API for Experiment Tracking on Kubernetes☆27Updated last year
- A Slurm-based HPC workload management environment, driven by Ansible.☆51Updated this week
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆88Updated 2 years ago
- Containerized testing of system components that impact AI workload performance☆14Updated last year
- How to Configure a GPU Cluster Running Ubuntu Linux☆54Updated 7 years ago
- Documentation repository for NVIDIA Cloud Native Technologies☆17Updated this week
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆274Updated this week