Cray-HPE / docs-csm
Cray-HPE System Management Documentation for Shasta, High-Performance-Computing-as-a-Service (HPCaaS).
☆29Updated this week
Alternatives and similar repositories for docs-csm:
Users that are interested in docs-csm are comparing it to the libraries listed below
- Spectrum Scale Installation and Configuration☆70Updated last week
- Confluent Cluster Management software☆32Updated last week
- Scripts to automate development/test setup for openshift integration with https://github.com/metal3-io/☆96Updated this week
- Cloud Resource Provisioning framework for IBM Storage Scale (or gpfs)☆29Updated last week
- Ansible roles for the Performance Co-Pilot toolkit☆20Updated last month
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆30Updated last month
- ☆27Updated 10 months ago
- Monitoring and visualization of InfiniBand Fabrics☆21Updated 3 years ago
- Slurm spank plugin to give each job private /tmp (and/or other dirs)☆22Updated 2 years ago
- OpenShift Migration Controller☆22Updated last week
- Kerberos credential support for batch environments☆14Updated 8 months ago
- A Slurm-based HPC workload management environment, driven by Ansible.☆55Updated last week
- Dynamic Registry Proxy☆15Updated 2 years ago
- The LVM Operator deploys and manages LVM storage on OpenShift clusters☆49Updated this week
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆20Updated 2 years ago
- The meat and potatoes behind farosctl☆13Updated 2 years ago
- This web portal is intended to give HPC users a view of the overall use of the HPC cluster and their own use.☆31Updated 3 months ago
- Sherlock - a set of script to assess database performance on OCP/k8s☆31Updated 9 months ago
- Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches☆58Updated 3 months ago
- VASTPY is the official Python SDK for the VAST Management System☆12Updated 3 weeks ago
- Slurm in Kubernetes☆42Updated 3 months ago
- The operator manages the ovn-kube components running on the DPU card for enabling OVS hardware offloading.☆27Updated 4 months ago
- Create and manage cluster networking configuration☆99Updated last week
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆239Updated this week
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆32Updated last week
- Deploy a Flux MiniCluster to Kubernetes with the operator☆31Updated 3 weeks ago
- Lustre Monitoring System☆23Updated 3 weeks ago
- ☆45Updated last week
- ☆82Updated this week
- A collection of diamond collectors for slurm.☆15Updated last year