An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.
☆290Updated this week
Alternatives and similar repositories for omnia
Users that are interested in omnia are comparing it to the libraries listed below
Sorting:
- A Slurm-based HPC workload management environment, driven by Ansible.☆67Updated this week
- Prometheus exporter for performance metrics from Slurm.☆275Jun 20, 2024Updated last year
- Ansible role for OpenHPC☆51Updated this week
- Open source web interface for Slurm HPC & AI clusters☆545Feb 13, 2026Updated 2 weeks ago
- My tools for the Slurm HPC workload manager☆569Updated this week
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆21Jul 11, 2022Updated 3 years ago
- HPC System and Software Testing Framework☆68Updated this week
- Ansible role for installing and managing the Slurm Workload Manager☆113Nov 24, 2025Updated 3 months ago
- Monitoring and visualization of InfiniBand Fabrics☆23Apr 19, 2021Updated 4 years ago
- A Slurm cluster using docker-compose☆465Feb 21, 2026Updated last week
- LBNL Node Health Check☆271Apr 18, 2025Updated 10 months ago
- Singularity implementation of k8s operator for interacting with SLURM.☆117Dec 29, 2020Updated 5 years ago
- Slurm spank plugin to give each job private /tmp (and/or other dirs)☆22Oct 16, 2025Updated 4 months ago
- Container plugin for Slurm Workload Manager☆416Feb 18, 2026Updated last week
- Ansible playbook for OpenHPC☆25Jun 11, 2019Updated 6 years ago
- OpenHPC Integration, Packaging, and Test Repo☆971Updated this week
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆43Jan 29, 2026Updated last month
- A few utilities for use on a SLURM cluster☆44Aug 26, 2025Updated 6 months ago
- SJTU HPC 开源项目:Spackenv (Spack ENVironment) switch environments between sysadmin, users and developers.☆22Jan 4, 2022Updated 4 years ago
- SLURM Tools and UBiLities☆74Aug 1, 2022Updated 3 years ago
- Terraform modules to replicate the HPC user experience in the cloud☆164Feb 20, 2026Updated last week
- Tools for building GPU clusters☆1,421Updated this week
- Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches☆74Nov 17, 2025Updated 3 months ago
- A coherent Ansible roles collection to simply deploy clusters of nodes.☆156Feb 19, 2026Updated last week
- Warewulf is a stateless and diskless container operating system provisioning system for large clusters of bare metal and/or virtual syste…☆624Updated this week
- Slurm on Google Cloud Platform☆190Sep 18, 2024Updated last year
- ☆15Nov 25, 2021Updated 4 years ago
- ☆17Jul 25, 2025Updated 7 months ago
- spart: a user-oriented partition info command for slurm☆24Sep 17, 2023Updated 2 years ago
- A daemon that uses cgroups to monitor and manage user behavior on login nodes☆76Jan 23, 2026Updated last month
- Storage Scale Installation and Configuration☆79Feb 20, 2026Updated last week
- ☆11Apr 5, 2024Updated last year
- Materials to teach terminal fundamentals for HPC users☆19Aug 18, 2021Updated 4 years ago
- A collection of diamond collectors for slurm.☆17Apr 20, 2023Updated 2 years ago
- HPC Container Maker☆508Feb 19, 2026Updated last week
- A modification to the slurm_showq code written by TACC.☆43Dec 9, 2024Updated last year
- Share Spack configuration files with other HPC sites☆70Mar 24, 2025Updated 11 months ago
- GitLab runner for HPC systems using ENROOT and SLURM☆33Sep 28, 2023Updated 2 years ago
- OSISM documentation☆11Mar 2, 2024Updated last year