ULHPC / puppet-slurm
A Puppet module designed to configure and manage SLURM(see https://slurm.schedmd.com/), an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters
☆19Updated 2 months ago
Alternatives and similar repositories for puppet-slurm:
Users that are interested in puppet-slurm are comparing it to the libraries listed below
- Prometheus exporter for slurm job/node data☆37Updated 7 months ago
- A Slurm-based HPC workload management environment, driven by Ansible.☆56Updated this week
- Custom Slurm tools☆25Updated 6 years ago
- SLURM Bank, a collection of wrapper scripts to give slurm GOLD like capabilities for managing resources.☆24Updated 6 years ago
- Bare Metal Provisioning system for HPC Linux clusters☆60Updated last week
- Create beegfs server and client☆24Updated 3 years ago
- A coherent Ansible roles collection to simply deploy clusters of nodes.☆127Updated this week
- InfiniBand fabric monitoring daemon written in Go☆30Updated last year
- Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches☆58Updated 3 months ago
- ☆27Updated 10 months ago
- Dynamic Registry Proxy☆15Updated 2 years ago
- Puppet module for SLURM client and server☆15Updated 3 years ago
- Ansible playbook for OpenHPC☆24Updated 5 years ago
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆31Updated last week
- Monitoring and visualization of InfiniBand Fabrics☆21Updated 3 years ago
- Ansible role for OpenHPC☆49Updated last month
- OCI-compatible engine to deploy Linux containers on HPC environments.☆135Updated 5 months ago
- Cray Lustre is HPE's curated Lustre distro for Cray EX and other Cray ClusterStor clients☆16Updated this week
- Python module for hardware detection and classification☆53Updated 4 months ago
- A quick and dirty rest interface to the slurm api and commands.☆11Updated 7 years ago
- SLURM jobcomp plugin to index data into an elasticsearch server☆14Updated 8 years ago
- Fluxion Graph-based Scheduler☆94Updated last week
- Collection of the Finnish Grid and Cloud Infrastructure Ansible playbooks☆54Updated last year
- A daemon that uses cgroups to monitor and manage user behavior on login nodes☆66Updated 8 months ago
- Prometheus exporter for performance metrics from Slurm.☆251Updated 9 months ago
- ☆45Updated last week
- A terminal based monitoring tool for InfiniBand networks using Detector (https://github.com/hhu-bsinfo/detector)☆13Updated 5 years ago
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆20Updated 2 years ago
- A distributed storage benchmark for file systems, object stores & block devices with support for GPUs☆191Updated 2 weeks ago
- Supercomputing. Seamlessly. Open, Interactive HPC Via the Web☆93Updated 3 years ago