ubccr / grendel
Bare Metal Provisioning system for HPC Linux clusters
☆60Updated last week
Alternatives and similar repositories for grendel:
Users that are interested in grendel are comparing it to the libraries listed below
- A Slurm-based HPC workload management environment, driven by Ansible.☆59Updated this week
- Slurm Lua SPANK plugin☆16Updated 3 months ago
- A daemon that uses cgroups to monitor and manage user behavior on login nodes☆68Updated 9 months ago
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆33Updated last month
- This web portal is intended to give HPC users a view of the overall use of the HPC cluster and their own use.☆32Updated last month
- Monitoring and visualization of InfiniBand Fabrics☆21Updated 4 years ago
- Ansible role for OpenHPC☆50Updated this week
- InfiniBand fabric monitoring daemon written in Go☆31Updated last year
- A collection of diamond collectors for slurm.☆16Updated 2 years ago
- A coherent Ansible roles collection to simply deploy clusters of nodes.☆130Updated this week
- Set of SLURM spank plugins used at LLNL☆26Updated 4 years ago
- Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches☆61Updated last month
- OCI-compatible engine to deploy Linux containers on HPC environments.☆138Updated 6 months ago
- Prometheus exporter for use with the Lustre parallel filesystem☆22Updated 6 months ago
- Warewulf is a scalable systems management suite originally developed to manage large high-performance Linux clusters.☆107Updated last year
- Info on CHPC Open OnDemand installation and customization☆15Updated 4 months ago
- Converts an Infiniband topology file to graphviz dot format or slurm topology.conf format☆13Updated 3 months ago
- Run VMs on an HPC cluster☆49Updated last year
- ☆16Updated 2 years ago
- ☆13Updated 3 years ago
- Prometheus exporter for use with the Lustre parallel filesystem☆40Updated 2 years ago
- ☆44Updated last month
- TrinityX is the new generation of ClusterVision's open-source HPC, A/I and cloudbursting platform. It is designed from the ground up to p…☆85Updated last month
- Lustre Monitoring System based on Collectd, Grafana and Influxdb☆45Updated last year
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆20Updated 2 years ago
- Export select slurm metrics to prometheus☆51Updated last month
- Confluent Cluster Management software☆32Updated this week
- GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.☆44Updated 7 months ago
- Pavilion is a Python 3 (3.5+) based framework for running and analyzing tests targeting HPC systems.☆44Updated this week
- Proxy SSH connections on a gateway☆106Updated 2 weeks ago