converged-computing / slurm-operatorLinks
Testing if I can implement slurm in an operator
☆15Updated last year
Alternatives and similar repositories for slurm-operator
Users that are interested in slurm-operator are comparing it to the libraries listed below
Sorting:
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆15Updated 2 months ago
- A terminal based monitoring tool for InfiniBand networks using Detector (https://github.com/hhu-bsinfo/detector)☆14Updated 6 years ago
- OCI-compatible engine to deploy Linux containers on HPC environments.☆146Updated last year
- Straw - The simple tool to suck the config out of your Slurm beverage!☆11Updated 3 years ago
- ☆54Updated last month
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆44Updated last month
- The Singularity SPANK plugin provides the users with an interface to launch an application within a Linux container.☆11Updated 2 months ago
- Deploy a Flux MiniCluster to Kubernetes with the operator☆39Updated 2 weeks ago
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆21Updated 3 years ago
- Create beegfs server and client☆24Updated 4 years ago
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆41Updated last month
- Bare Metal Provisioning system for HPC Linux clusters☆67Updated last week
- A Slurm-based HPC workload management environment, driven by Ansible.☆67Updated this week
- ☆13Updated 10 months ago
- ☆50Updated this week
- OpenAPI Golang client library for Slurm REST API. A Slinky project.☆21Updated last month
- Node feature discovery, detects the available hardware features and configuration in a cluster.☆17Updated last month
- deploys Lmod☆17Updated 2 weeks ago
- Pavilion is a Python 3 (3.5+) based framework for running and analyzing tests targeting HPC systems.☆46Updated this week
- Kerberos credential support for batch environments☆21Updated 3 years ago
- Prometheus exporter for a Infiniband Fabric☆69Updated 2 years ago
- Kerberos credential support for batch environments☆16Updated last year
- Ansible roles for the Performance Co-Pilot toolkit☆21Updated last week
- A quick way of spawning many batch jobs☆14Updated 3 years ago
- InfiniBand fabric monitoring daemon written in Go☆32Updated 8 months ago
- Software layer of the EESSI project☆33Updated this week
- Slurm in Kubernetes☆43Updated 2 months ago
- Run Slurm on Kubernetes. A Slinky project.☆218Updated last week
- Slurm Lua SPANK plugin☆16Updated 11 months ago
- HPC dashboards developed for SRCC systems☆19Updated 4 years ago