Slurm: A Highly Scalable Workload Manager
☆3,928Apr 24, 2026Updated this week
Alternatives and similar repositories for slurm
Users that are interested in slurm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MUNGE (MUNGE Uid 'N' Gid Emporium) is an authentication service for creating and validating user credentials.☆304Apr 16, 2026Updated last week
- Open source web interface for Slurm HPC & AI clusters☆556Apr 16, 2026Updated last week
- Python Interface to Slurm☆560Updated this week
- An HPC workload manager and job scheduler for desktops, clusters, and clouds.☆791Apr 10, 2026Updated 2 weeks ago
- My tools for the Slurm HPC workload manager☆577Apr 17, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Container plugin for Slurm Workload Manager☆429Updated this week
- Lmod: An Environment Module System based on Lua, Reads TCL Modules, Supports a Software Hierarchy☆591Updated this week
- LBNL Node Health Check☆278Apr 7, 2026Updated 2 weeks ago
- Slurm on Google Cloud Platform☆190Sep 18, 2024Updated last year
- Open MPI main development repository☆2,565Apr 18, 2026Updated last week
- Singularity has been renamed to Apptainer as part of us moving the project to the Linux Foundation. This repo has been persisted as a sna…☆2,613Oct 10, 2022Updated 3 years ago
- OpenPMIx Project Repository☆260Apr 16, 2026Updated last week
- Apptainer: Application containers for Linux☆1,814Apr 17, 2026Updated last week
- A Slurm cluster using docker-compose☆496Apr 9, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Prometheus exporter for performance metrics from Slurm.☆280Jun 20, 2024Updated last year
- OpenHPC Integration, Packaging, and Test Repo☆979Updated this week
- A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.☆930Apr 15, 2026Updated last week
- A flexible package manager that supports multiple versions, configurations, platforms, and compilers.☆5,007Apr 20, 2026Updated last week
- Supercomputing. Seamlessly. Open, Interactive HPC Via the Web☆449Updated this week
- Steps to create a small slurm cluster with GPU enabled nodes☆272Feb 2, 2023Updated 3 years ago
- Optimized primitives for collective multi-GPU communication☆4,640Updated this week
- Environment Modules: provides dynamic modification of a user's environment☆844Mar 20, 2026Updated last month
- Tools for building GPU clusters☆1,430Feb 23, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆42,275Updated this week
- A Cloud Native Batch System (Project under CNCF)☆5,494Updated this week
- SingularityCE is the Community Edition of Singularity, an open source container platform designed to be simple, fast, and secure.☆956Apr 20, 2026Updated last week
- core services for the Flux resource management framework☆200Updated this week
- Shifter - Linux Containers for HPC☆362Oct 25, 2025Updated 6 months ago
- Torque Repository☆264May 12, 2023Updated 2 years ago
- Singularity implementation of k8s operator for interacting with SLURM.☆117Dec 29, 2020Updated 5 years ago
- Ansible role for installing and managing the Slurm Workload Manager☆117Nov 24, 2025Updated 5 months ago
- gather and plot data about Slurm scheduling and job statistics☆52Sep 23, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆104Apr 6, 2026Updated 3 weeks ago
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆708Updated this week
- NVIDIA device plugin for Kubernetes☆3,729Updated this week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,694Dec 1, 2025Updated 4 months ago
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆291Apr 20, 2026Updated last week
- Official MPICH Repository☆672Updated this week
- Super Computing On Web☆320Apr 19, 2026Updated last week