converged-computing / slurm-operator
Testing if I can implement slurm in an operator
☆11Updated this week
Related projects ⓘ
Alternatives and complementary repositories for slurm-operator
- Deploy a Flux MiniCluster to Kubernetes with the operator☆31Updated this week
- ☆35Updated 2 weeks ago
- CSI driver for CernVM-FS☆19Updated last month
- Create beegfs server and client☆23Updated 2 years ago
- OCI-compatible engine to deploy Linux containers on HPC environments.☆129Updated last week
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆17Updated 2 years ago
- ☆33Updated 2 weeks ago
- A terminal based monitoring tool for InfiniBand networks using Detector (https://github.com/hhu-bsinfo/detector)☆11Updated 5 years ago
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆28Updated this week
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆26Updated 3 months ago
- Software layer of the EESSI project☆24Updated last week
- A quick way of spawning many batch jobs☆13Updated 2 years ago
- InfiniBand fabric monitoring daemon written in Go☆29Updated 7 months ago
- ☆26Updated 4 months ago
- Bare Metal Provisioning system for HPC Linux clusters☆57Updated this week
- A Slurm-based HPC workload management environment, driven by Ansible.☆51Updated this week
- YAML-based database of datacenter infrastructures☆14Updated 2 months ago
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆27Updated 2 months ago
- Slurm Simulator: Slurm Modification to Enable its Simulation☆29Updated 9 months ago
- Ansible playbook for OpenHPC☆24Updated 5 years ago
- Prometheus exporter for use with the Lustre parallel filesystem☆18Updated 2 weeks ago
- GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.☆42Updated last month
- Container-based Slurm cluster with support for running on multiple ssh-accessible computers. Currently it is based on podman, systemd, no…☆20Updated 3 years ago
- Slurm SPANK plugin to let users change GPU compute mode in jobs☆13Updated last year
- Slurm Lua SPANK plugin☆16Updated 2 years ago
- ☆42Updated this week
- TUI for browsing, canceling, and inspecting SLURM jobs☆10Updated 11 months ago
- Pavilion is a Python 3 (3.5+) based framework for running and analyzing tests targeting HPC systems.☆44Updated this week
- Prometheus exporter for a Infiniband Fabric☆54Updated 10 months ago
- Kerberos credential support for batch environments☆12Updated 3 months ago