clusterinthecloud / ansible
Ansible config for Cluster in the Cloud
☆10Updated 8 months ago
Alternatives and similar repositories for ansible:
Users that are interested in ansible are comparing it to the libraries listed below
- A Slurm-based HPC workload management environment, driven by Ansible.☆52Updated this week
- This web portal is intended to give HPC users a view of the overall use of the HPC cluster and their own use.☆29Updated last month
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆27Updated 5 months ago
- Ansible role for OpenHPC☆47Updated last month
- User Fencing Tools☆16Updated 2 years ago
- A coherent Ansible roles collection to simply deploy clusters of nodes.☆120Updated 2 weeks ago
- ☆13Updated 3 years ago
- Ansible playbook for OpenHPC☆24Updated 5 years ago
- A quick way of spawning many batch jobs☆14Updated 2 years ago
- Puppet Environment repo for Magic Castle - https://github.com/ComputeCanada/magic_castle☆13Updated this week
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆19Updated 2 years ago
- ☆40Updated 2 weeks ago
- Terraform examples for deploying HPC clusters on OCI☆41Updated 3 months ago
- Export select slurm metrics to prometheus☆47Updated this week
- Terraform config for Cluster in the Cloud☆20Updated 9 months ago
- Monitoring and visualization of InfiniBand Fabrics☆20Updated 3 years ago
- Miscellaneous plugins for Slurm (http://slurm.schedmd.com/)☆18Updated last week
- An inventory tool for xcat cluster☆8Updated 8 months ago
- A daemon that uses cgroups to monitor and manage user behavior on login nodes☆62Updated 5 months ago
- A collection of diamond collectors for slurm.☆15Updated last year
- SLURM job completion log database and query tool☆9Updated 8 years ago
- SLURM jobcomp plugin to index data into an elasticsearch server☆14Updated 8 years ago
- ☆15Updated 7 years ago
- ☆35Updated last month
- Warewulf is a scalable systems management suite originally developed to manage large high-performance Linux clusters.☆107Updated 9 months ago
- HPC dashboards developed for SRCC systems☆18Updated 3 years ago
- ☆28Updated 5 years ago
- Pavilion is a Python 3 (3.5+) based framework for running and analyzing tests targeting HPC systems.☆44Updated this week
- An open framework for collecting and analyzing HPC metrics.☆87Updated last week
- Ansible modules for HPC clusters☆15Updated 5 months ago