ULHPC / puppet-slurm
A Puppet module designed to configure and manage SLURM(see https://slurm.schedmd.com/), an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters
☆19Updated last month
Alternatives and similar repositories for puppet-slurm:
Users that are interested in puppet-slurm are comparing it to the libraries listed below
- Bare Metal Provisioning system for HPC Linux clusters☆59Updated this week
- Prometheus exporter for slurm job/node data☆36Updated 6 months ago
- A Slurm-based HPC workload management environment, driven by Ansible.☆55Updated this week
- YAML-based database of datacenter infrastructures☆16Updated this week
- SLURM Bank, a collection of wrapper scripts to give slurm GOLD like capabilities for managing resources.☆24Updated 6 years ago
- Spectrum Scale Installation and Configuration☆69Updated this week
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆29Updated this week
- Ansible role for OpenHPC☆47Updated 2 months ago
- Slurm Lua SPANK plugin☆16Updated 3 weeks ago
- Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches☆56Updated 2 months ago
- HPCPerfStats is an automated resource-usage monitoring and analysis package.☆46Updated this week
- Dynamic Registry Proxy☆15Updated last year
- Fluxion Graph-based Scheduler☆92Updated last week
- Create beegfs server and client☆24Updated 3 years ago
- Warewulf is a scalable systems management suite originally developed to manage large high-performance Linux clusters.☆106Updated 10 months ago
- HPC dashboards developed for SRCC systems☆18Updated 3 years ago
- ☆27Updated 9 months ago
- Monitoring and visualization of InfiniBand Fabrics☆21Updated 3 years ago
- ☆13Updated 3 years ago
- Slurm SPANK plugin to let users change GPU compute mode in jobs☆12Updated last year
- Confluent Cluster Management software☆32Updated this week
- TrinityX is the new generation of ClusterVision's open-source HPC, A/I and cloudbursting platform. It is designed from the ground up to p…☆77Updated 2 weeks ago
- Ansible playbook for OpenHPC☆24Updated 5 years ago
- Kerberos credential support for batch environments☆14Updated 6 months ago
- A coherent Ansible roles collection to simply deploy clusters of nodes.☆124Updated this week
- A collection of diamond collectors for slurm.☆15Updated last year
- Prometheus exporter for use with the Lustre parallel filesystem☆22Updated 3 months ago
- Cray-HPE System Management Documentation for Shasta, High-Performance-Computing-as-a-Service (HPCaaS).☆29Updated this week
- Slurm spank plugin to give each job private /tmp (and/or other dirs)☆22Updated 2 years ago
- Confluent is a software package to handle essential bootstrap and operation of scale-out server configurations.☆37Updated this week