ULHPC / puppet-slurm
A Puppet module designed to configure and manage SLURM(see https://slurm.schedmd.com/), an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters
☆18Updated 4 months ago
Alternatives and similar repositories for puppet-slurm:
Users that are interested in puppet-slurm are comparing it to the libraries listed below
- A Slurm-based HPC workload management environment, driven by Ansible.☆58Updated this week
- Prometheus exporter for slurm job/node data☆38Updated this week
- Dynamic Registry Proxy☆15Updated 2 years ago
- SLURM Bank, a collection of wrapper scripts to give slurm GOLD like capabilities for managing resources.☆24Updated 6 years ago
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆33Updated 3 weeks ago
- Custom Slurm tools☆25Updated 6 years ago
- Create beegfs server and client☆24Updated 3 years ago
- Cluster stack based on Salt☆18Updated 5 years ago
- A terminal based monitoring tool for InfiniBand networks using Detector (https://github.com/hhu-bsinfo/detector)☆14Updated 5 years ago
- Ansible role for OpenHPC☆50Updated this week
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆20Updated 2 years ago
- Fluxion Graph-based Scheduler☆95Updated last week
- Bare Metal Provisioning system for HPC Linux clusters☆60Updated last week
- Ansible playbook for OpenHPC☆25Updated 5 years ago
- InfiniBand fabric monitoring daemon written in Go☆31Updated last year
- Integrated Manager for Lustre☆75Updated 4 years ago
- HPCPerfStats (formerly TACC Stats) is an automated resource-usage monitoring and analysis package.☆46Updated this week
- This web portal is intended to give HPC users a view of the overall use of the HPC cluster and their own use.☆32Updated last month
- Slurm job script archival☆12Updated last month
- Slurm Lua SPANK plugin☆16Updated 3 months ago
- A distributed storage benchmark for file systems, object stores & block devices with support for GPUs☆201Updated this week
- Prometheus exporter for lustre☆17Updated last week
- Monitoring and visualization of InfiniBand Fabrics☆21Updated 4 years ago
- Testing if I can implement slurm in an operator☆14Updated 6 months ago
- Warewulf is a scalable systems management suite originally developed to manage large high-performance Linux clusters.☆107Updated last year
- REMORA: REsource MOnitoring for Remote Applications☆59Updated last week
- Slurm SPANK plugin to let users change GPU compute mode in jobs☆12Updated 2 years ago
- Generate graphviz dot files from InfiniBand topology dumps.☆16Updated last year
- Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches☆61Updated last month
- ☆27Updated last year