Run Slurm on Kubernetes. A Slinky project.
☆237Updated this week
Alternatives and similar repositories for slurm-operator
Users that are interested in slurm-operator are comparing it to the libraries listed below
Sorting:
- Run Slurm as a Kubernetes scheduler. A Slinky project.☆66Updated this week
- OpenAPI Golang client library for Slurm REST API. A Slinky project.☆22Updated this week
- Run Slurm in Kubernetes☆358Updated this week
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆16Nov 7, 2025Updated 3 months ago
- A Slurm cluster for Kubernetes☆68Jul 26, 2024Updated last year
- ☆33Updated this week
- Deploy a Flux MiniCluster to Kubernetes with the operator☆40Jan 9, 2026Updated last month
- A toolkit for discovering cluster network topology.☆99Feb 19, 2026Updated last week
- A simple example of running a MongoDB instance to query a database☆10Aug 31, 2022Updated 3 years ago
- Slurm in Kubernetes☆43Nov 20, 2025Updated 3 months ago
- scalable data movement in Exascale Supercomputers☆17Updated this week
- ☆57Dec 12, 2025Updated 2 months ago
- KJob: Tool for CLI-loving ML researchers☆41Dec 29, 2025Updated 2 months ago
- DGXC Benchmarking provides recipes in ready-to-use templates for evaluating performance of specific AI use cases across hardware and soft…☆64Updated this week
- NVIDIA DRA Driver for GPUs☆574Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,144Updated this week
- NVIDIA NCCL Tests for Distributed Training☆137Updated this week
- ☆32Oct 31, 2025Updated 4 months ago
- ☆336Feb 22, 2026Updated last week
- Proposals and discussions for the AI Conformance Working Group.☆19Dec 17, 2025Updated 2 months ago
- Scripts for viewing Slurm batch job resource usages☆11Jan 3, 2022Updated 4 years ago
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated 10 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated last month
- Optimized primitives for collective multi-GPU communication☆10May 8, 2024Updated last year
- kubernetes object cache☆13Mar 6, 2023Updated 2 years ago
- NVIDIA Network Operator☆325Updated this week
- Vault <-> Kubernetes Secrets☆12Jan 25, 2022Updated 4 years ago
- Lithops-based Serverless implementation of the METASPACE spatial metabolomics annotation pipeline☆12Jul 6, 2023Updated 2 years ago
- ☆20Feb 5, 2026Updated 3 weeks ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆150Updated this week
- The Job Performance (SUPReMM) module for Open XDMoD.☆11Jan 30, 2026Updated last month
- helm charts for deploying models with llm-d☆28Updated this week
- ☆12May 30, 2025Updated 9 months ago
- InfiniBand SR-IOV CNI☆13Feb 13, 2026Updated 2 weeks ago
- Universal Python SDK to run AI workloads on Kubernetes☆80Updated this week
- This repo tracks all enhanced patches to the KuberWharf Kubernetes☆31Oct 25, 2024Updated last year
- Example DRA driver that developers can fork and modify to get them started writing their own.☆120Updated this week
- Rabbitmq operator for kubernetes☆13Jun 8, 2020Updated 5 years ago
- Prometheus exporter for the SLURM scheduler☆19Oct 24, 2025Updated 4 months ago