lopentusska / slurm_ubuntu_gpu_clusterLinks
Instructions for setting up a Slurm gpu cluster on Ubuntu 22.04.
☆25Updated last year
Alternatives and similar repositories for slurm_ubuntu_gpu_cluster
Users that are interested in slurm_ubuntu_gpu_cluster are comparing it to the libraries listed below
Sorting:
- Instructions for setting up a SLURM cluster using Ubuntu 18.04.3 with GPUs.☆151Updated 4 years ago
- Container plugin for Slurm Workload Manager☆344Updated 7 months ago
- Open source web interface for Slurm HPC & AI clusters☆431Updated this week
- NVIDIA NCCL Tests for Distributed Training☆92Updated last week
- A Slurm cluster using docker-compose☆371Updated 8 months ago
- Jobstats is a job monitoring platform for CPU and GPU clusters☆74Updated last month
- Ansible role for installing and managing the Slurm Workload Manager☆102Updated last month
- Slurm in Docker - Exploring Slurm using CentOS 7 based Docker images☆130Updated 5 years ago
- My tools for the Slurm HPC workload manager☆508Updated last week
- Prometheus exporter for performance metrics from Slurm.☆252Updated 11 months ago
- MIG Partition Editor for NVIDIA GPUs☆200Updated this week
- core services for the Flux resource management framework☆186Updated this week
- Tutorial for installing Open XDMoD, OnDemand, & ColdFront☆145Updated 2 months ago
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆520Updated last month
- ☆98Updated 8 months ago
- Slurm-Mail is a drop in replacement for Slurm's e-mails to give users much more information about their jobs compared to the standard Slu…☆108Updated this week
- Export select slurm metrics to prometheus☆52Updated 2 months ago
- Steps to create a small slurm cluster with GPU enabled nodes☆270Updated 2 years ago
- You should offer both Podman and Apptainer with name spaces on your HPC systems☆60Updated last year
- A dummy's guide to setting up (and using) HPC clusters on Ubuntu 22.04LTS using Slurm and Munge. Created by the Quant Club @ UIowa.☆317Updated last year
- Tools for building GPU clusters☆1,360Updated last month
- Ansible role for OpenHPC☆50Updated last week
- A tool for bandwidth measurements on NVIDIA GPUs.☆449Updated last month
- NCCL Tests☆1,127Updated this week
- slurm-docker-integration provides HPC-Kubernetes integration artifacts☆25Updated last year
- KvikIO - High Performance File IO☆210Updated this week
- Supercomputing. Seamlessly. Open, Interactive HPC Via the Web☆339Updated this week
- SLURM Tools and UBiLities☆70Updated 2 years ago
- OCI-compatible engine to deploy Linux containers on HPC environments.☆139Updated 7 months ago
- Fluxion Graph-based Scheduler☆97Updated 3 weeks ago