Cray-HPE / docs-csm
Cray-HPE System Management Documentation for Shasta, High-Performance-Computing-as-a-Service (HPCaaS).
☆29Updated this week
Alternatives and similar repositories for docs-csm:
Users that are interested in docs-csm are comparing it to the libraries listed below
- A Slurm-based HPC workload management environment, driven by Ansible.☆58Updated this week
- Spectrum Scale Installation and Configuration☆71Updated 2 weeks ago
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆33Updated last week
- Ansible roles for the Performance Co-Pilot toolkit☆20Updated 2 months ago
- Confluent Cluster Management software☆32Updated this week
- OCI-compatible engine to deploy Linux containers on HPC environments.☆136Updated 5 months ago
- ☆47Updated this week
- A collection of diamond collectors for slurm.☆15Updated 2 years ago
- OpenShift Migration Controller☆22Updated 2 weeks ago
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆33Updated this week
- This web portal is intended to give HPC users a view of the overall use of the HPC cluster and their own use.☆32Updated last week
- Cloud Resource Provisioning framework for IBM Storage Scale (or gpfs)☆30Updated last week
- Bare Metal Provisioning system for HPC Linux clusters☆60Updated this week
- Prometheus exporter for use with the Lustre parallel filesystem☆39Updated 2 years ago
- Prometheus exporter for use with the Lustre parallel filesystem☆22Updated 5 months ago
- Slurm spank plugin to give each job private /tmp (and/or other dirs)☆22Updated 2 years ago
- Deploy a Flux MiniCluster to Kubernetes with the operator☆32Updated last month
- ☆43Updated 3 weeks ago
- Monitoring and visualization of InfiniBand Fabrics☆21Updated 4 years ago
- Prometheus exporter for a Infiniband Fabric☆59Updated last year
- Kerberos credential support for batch environments☆15Updated 8 months ago
- Lustre Monitoring Tools☆72Updated 5 months ago
- Slurm in Kubernetes☆41Updated 4 months ago
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆242Updated this week
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆20Updated 2 years ago
- ☆27Updated 11 months ago
- Performance Benchmarking scripts for Gluster☆31Updated 6 years ago
- ☆83Updated this week
- Prometheus exporter for slurm job/node data☆37Updated 8 months ago
- Testing if I can implement slurm in an operator☆14Updated 5 months ago