Cray-HPE / docs-csmLinks
Cray-HPE System Management Documentation for Shasta, High-Performance-Computing-as-a-Service (HPCaaS).
☆31Updated this week
Alternatives and similar repositories for docs-csm
Users that are interested in docs-csm are comparing it to the libraries listed below
Sorting:
- ☆50Updated last month
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆21Updated 3 years ago
- OCI-compatible engine to deploy Linux containers on HPC environments.☆141Updated last year
- A Slurm-based HPC workload management environment, driven by Ansible.☆67Updated last week
- Intel HPC Containers using Singularity☆19Updated 2 years ago
- Spectrum Scale Installation and Configuration☆77Updated this week
- Fluxion Graph-based Scheduler☆101Updated 2 weeks ago
- Slurm Simulator: Slurm Modification to Enable its Simulation☆37Updated last year
- Slurm spank plugin to give each job private /tmp (and/or other dirs)☆22Updated last month
- Bridge operator repo☆21Updated 2 months ago
- Prometheus exporter for a Infiniband Fabric☆68Updated last year
- Data Accelerator: Creates a burst buffer from generic hardware and integrates it with Slurm https://www.hpc.cam.ac.uk/research/data-acc h…☆18Updated 2 years ago
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆14Updated 3 weeks ago
- Ansible role for installing and managing the Slurm Workload Manager☆111Updated this week
- Integrations between commercial and open source applications and LSF published by IBM and others.☆17Updated last year
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆39Updated 3 weeks ago
- HPCPerfStats (formerly TACC Stats) is an automated resource-usage monitoring and analysis package for HPC Clusters.☆52Updated 3 weeks ago
- Lustre Monitoring System☆26Updated 8 months ago
- A collection of diamond collectors for slurm.☆17Updated 2 years ago
- Testing if I can implement slurm in an operator☆15Updated last year
- This web portal is intended to give HPC users a view of the overall use of the HPC cluster and their own use.☆36Updated last month
- Lustre Monitoring Tools☆76Updated last month
- Prometheus exporter for use with the Lustre parallel filesystem☆28Updated 3 weeks ago
- A distributed storage benchmark for file systems, object stores & block devices with support for GPUs☆236Updated 2 weeks ago
- Terraform examples for deploying HPC clusters on OCI☆60Updated last month
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆284Updated this week
- ☆28Updated last year
- Lustre Monitoring System based on Collectd, Grafana and Influxdb☆45Updated last year
- Slurm Exporter for Prometheus☆18Updated last year
- Create and deploy virtual-experiments - co-processing computational workflows☆10Updated 4 months ago