SlinkyProject/slurm-operator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SlinkyProject/slurm-operator)

SlinkyProject / slurm-operator

Run Slurm on Kubernetes. A Slinky project.

☆237Updated this week

Alternatives and similar repositories for slurm-operator

Users that are interested in slurm-operator are comparing it to the libraries listed below

Sorting:

SlinkyProject / slurm-bridge
View on GitHub
Run Slurm as a Kubernetes scheduler. A Slinky project.
☆66Updated this week
SlinkyProject / slurm-client
View on GitHub
OpenAPI Golang client library for Slurm REST API. A Slinky project.
☆22Updated this week
nebius / soperator
View on GitHub
Run Slurm in Kubernetes
☆358Updated this week
SlinkyProject / slurm-exporter
View on GitHub
Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.
☆16Nov 7, 2025Updated 3 months ago
stackhpc / slurm-k8s-cluster
View on GitHub
A Slurm cluster for Kubernetes
☆68Jul 26, 2024Updated last year
DDNStorage / exa-csi-driver
View on GitHub
☆33Updated this week
flux-framework / flux-operator
View on GitHub
Deploy a Flux MiniCluster to Kubernetes with the operator
☆40Jan 9, 2026Updated last month
NVIDIA / topograph
View on GitHub
A toolkit for discovering cluster network topology.
☆99Feb 19, 2026Updated last week
singularityhub / singularity-compose-examples
View on GitHub
A simple example of running a MongoDB instance to query a database
☆10Aug 31, 2022Updated 3 years ago
vultr / slik
View on GitHub
Slurm in Kubernetes
☆43Nov 20, 2025Updated 3 months ago
argonne-lcf / copper
View on GitHub
scalable data movement in Exascale Supercomputers
☆17Updated this week
NERSC / podman-hpc
View on GitHub
☆57Dec 12, 2025Updated 2 months ago
kubernetes-sigs / kjob
View on GitHub
KJob: Tool for CLI-loving ML researchers
☆41Dec 29, 2025Updated 2 months ago
NVIDIA / dgxc-benchmarking
View on GitHub
DGXC Benchmarking provides recipes in ready-to-use templates for evaluating performance of specific AI use cases across hardware and soft…
☆64Updated this week
NVIDIA / k8s-dra-driver-gpu
View on GitHub
NVIDIA DRA Driver for GPUs
☆574Updated this week
NVIDIA / KAI-Scheduler
View on GitHub
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆1,144Updated this week
coreweave / nccl-tests
View on GitHub
NVIDIA NCCL Tests for Distributed Training
☆137Updated this week
GoogleCloudPlatform / cluster-health-scanner
View on GitHub
☆32Oct 31, 2025Updated 4 months ago
Mellanox / k8s-rdma-shared-dev-plugin
View on GitHub
☆336Feb 22, 2026Updated last week
kubernetes-sigs / wg-ai-conformance
View on GitHub
Proposals and discussions for the AI Conformance Working Group.
☆19Dec 17, 2025Updated 2 months ago
nauhpc / jobstats
View on GitHub
Scripts for viewing Slurm batch job resource usages
☆11Jan 3, 2022Updated 4 years ago
HPCToolkit / hpctoolkit-tutorial-examples
View on GitHub
CPU and GPU tutorial examples
☆13Apr 4, 2025Updated 10 months ago
foundation-model-stack / fms-acceleration
View on GitHub
🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
☆13Jan 30, 2026Updated last month
boweiliu / nccl
View on GitHub
Optimized primitives for collective multi-GPU communication
☆10May 8, 2024Updated last year
boz / kcache
View on GitHub
kubernetes object cache
☆13Mar 6, 2023Updated 2 years ago
Mellanox / network-operator
View on GitHub
NVIDIA Network Operator
☆325Updated this week
grafana / pentagon
View on GitHub
Vault <-> Kubernetes Secrets
☆12Jan 25, 2022Updated 4 years ago
metaspace2020 / Lithops-METASPACE
View on GitHub
Lithops-based Serverless implementation of the METASPACE spatial metabolomics annotation pipeline
☆12Jul 6, 2023Updated 2 years ago
r-ccs-cms / sbd
View on GitHub
☆20Feb 5, 2026Updated 3 weeks ago
NVIDIA / k8s-nim-operator
View on GitHub
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆150Updated this week
ubccr / xdmod-supremm
View on GitHub
The Job Performance (SUPReMM) module for Open XDMoD.
☆11Jan 30, 2026Updated last month
llm-d-incubation / llm-d-modelservice
View on GitHub
helm charts for deploying models with llm-d
☆28Updated this week
aws-samples / ec2-topology-aware-for-slurm
View on GitHub
☆12May 30, 2025Updated 9 months ago
openshift / ib-sriov-cni
View on GitHub
InfiniBand SR-IOV CNI
☆13Feb 13, 2026Updated 2 weeks ago
kubeflow / sdk
View on GitHub
Universal Python SDK to run AI workloads on Kubernetes
☆80Updated this week
kubewharf / enhanced-k8s
View on GitHub
This repo tracks all enhanced patches to the KuberWharf Kubernetes
☆31Oct 25, 2024Updated last year
kubernetes-sigs / dra-example-driver
View on GitHub
Example DRA driver that developers can fork and modify to get them started writing their own.
☆120Updated this week
skylt / rabbitmq-operator
View on GitHub
Rabbitmq operator for kubernetes
☆13Jun 8, 2020Updated 5 years ago
lcrownover / prometheus-slurm-exporter
View on GitHub
Prometheus exporter for the SLURM scheduler
☆19Oct 24, 2025Updated 4 months ago