Slurm: A Highly Scalable Workload Manager
☆3,971May 15, 2026Updated this week
Alternatives and similar repositories for slurm
Users that are interested in slurm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MUNGE (MUNGE Uid 'N' Gid Emporium) is an authentication service for creating and validating user credentials.☆305Updated this week
- Open source web interface for Slurm HPC & AI clusters☆563Updated this week
- Python Interface to Slurm☆562Updated this week
- An HPC workload manager and job scheduler for desktops, clusters, and clouds.☆794Apr 10, 2026Updated last month
- My tools for the Slurm HPC workload manager☆578Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Container plugin for Slurm Workload Manager☆436May 12, 2026Updated last week
- Lmod: An Environment Module System based on Lua, Reads TCL Modules, Supports a Software Hierarchy☆598May 10, 2026Updated last week
- LBNL Node Health Check☆279Apr 7, 2026Updated last month
- Slurm on Google Cloud Platform☆190Sep 18, 2024Updated last year
- Open MPI main development repository☆2,582Updated this week
- Singularity has been renamed to Apptainer as part of us moving the project to the Linux Foundation. This repo has been persisted as a sna…☆2,611Oct 10, 2022Updated 3 years ago
- OpenPMIx Project Repository☆262Updated this week
- Apptainer: Application containers for Linux☆1,833May 11, 2026Updated last week
- A Slurm cluster using docker-compose☆504May 8, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Prometheus exporter for performance metrics from Slurm.☆282Jun 20, 2024Updated last year
- OpenHPC Integration, Packaging, and Test Repo☆983Updated this week
- A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.☆937May 12, 2026Updated last week
- A flexible package manager that supports multiple versions, configurations, platforms, and compilers.☆5,045Updated this week
- Supercomputing. Seamlessly. Open, Interactive HPC Via the Web☆458Updated this week
- Steps to create a small slurm cluster with GPU enabled nodes☆272Feb 2, 2023Updated 3 years ago
- Optimized primitives for collective multi-GPU communication☆4,699Updated this week
- Environment Modules: provides dynamic modification of a user's environment☆846Mar 20, 2026Updated last month
- Tools for building GPU clusters☆1,435Apr 27, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆42,529Updated this week
- A Cloud Native Batch System (Project under CNCF)☆5,570Updated this week
- SingularityCE is the Community Edition of Singularity, an open source container platform designed to be simple, fast, and secure.☆962Updated this week
- core services for the Flux resource management framework☆200Updated this week
- Shifter - Linux Containers for HPC☆362Oct 25, 2025Updated 6 months ago
- Torque Repository☆263May 12, 2023Updated 3 years ago
- Singularity implementation of k8s operator for interacting with SLURM.☆118Dec 29, 2020Updated 5 years ago
- gather and plot data about Slurm scheduling and job statistics☆51Sep 23, 2014Updated 11 years ago
- Ansible role for installing and managing the Slurm Workload Manager☆118Nov 24, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆106Apr 6, 2026Updated last month
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆723Apr 21, 2026Updated 3 weeks ago
- NVIDIA device plugin for Kubernetes☆3,755Updated this week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,690Dec 1, 2025Updated 5 months ago
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆292Updated this week
- Official MPICH Repository☆676Updated this week
- Super Computing On Web☆325Updated this week