An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.
☆290Mar 16, 2026Updated this week
Alternatives and similar repositories for omnia
Users that are interested in omnia are comparing it to the libraries listed below
Sorting:
- A Slurm-based HPC workload management environment, driven by Ansible.☆67Updated this week
- Prometheus exporter for performance metrics from Slurm.☆276Jun 20, 2024Updated last year
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆21Jul 11, 2022Updated 3 years ago
- Open source web interface for Slurm HPC & AI clusters☆550Mar 2, 2026Updated 2 weeks ago
- Ansible playbook for OpenHPC☆25Jun 11, 2019Updated 6 years ago
- My tools for the Slurm HPC workload manager☆570Mar 13, 2026Updated last week
- Ansible role for OpenHPC☆51Mar 2, 2026Updated 2 weeks ago
- HPC System and Software Testing Framework☆68Updated this week
- A Slurm cluster using docker-compose☆474Mar 12, 2026Updated last week
- Materials to teach terminal fundamentals for HPC users☆19Aug 18, 2021Updated 4 years ago
- Monitoring and visualization of InfiniBand Fabrics☆23Apr 19, 2021Updated 4 years ago
- OpenHPC Integration, Packaging, and Test Repo☆975Mar 14, 2026Updated last week
- LBNL Node Health Check☆275Apr 18, 2025Updated 11 months ago
- Singularity implementation of k8s operator for interacting with SLURM.☆117Dec 29, 2020Updated 5 years ago
- A coherent Ansible roles collection to simply deploy clusters of nodes.☆156Mar 13, 2026Updated last week
- Terraform modules to replicate the HPC user experience in the cloud☆163Feb 27, 2026Updated 3 weeks ago
- Tools for building GPU clusters☆1,424Feb 23, 2026Updated 3 weeks ago
- A modification to the slurm_showq code written by TACC.☆43Dec 9, 2024Updated last year
- Slurm spank plugin to give each job private /tmp (and/or other dirs)☆22Oct 16, 2025Updated 5 months ago
- A few utilities for use on a SLURM cluster☆44Updated this week
- Container plugin for Slurm Workload Manager☆422Feb 18, 2026Updated last month
- Ansible role for installing and managing the Slurm Workload Manager☆115Nov 24, 2025Updated 3 months ago
- Environment modules for NGC containers☆29Nov 19, 2021Updated 4 years ago
- SJTU HPC 开源项目:Spackenv (Spack ENVironment) switch environments between sysadmin, users and developers.☆22Jan 4, 2022Updated 4 years ago
- spart: a user-oriented partition info command for slurm☆24Sep 17, 2023Updated 2 years ago
- Slurm on Google Cloud Platform☆190Sep 18, 2024Updated last year
- ☆11Apr 5, 2024Updated last year
- CAST can enhance the system management of cluster-wide resources. It consists of the open source tools: cluster system management (CSM) a…☆27May 13, 2022Updated 3 years ago
- A collection of diamond collectors for slurm.☆17Apr 20, 2023Updated 2 years ago
- SLURM Tools and UBiLities☆74Aug 1, 2022Updated 3 years ago
- Storage Scale Installation and Configuration☆79Mar 13, 2026Updated last week
- HPC Container Maker☆512Mar 13, 2026Updated last week
- Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches☆74Nov 17, 2025Updated 4 months ago
- Slurm: A Highly Scalable Workload Manager☆3,795Mar 14, 2026Updated last week
- ☆15Nov 25, 2021Updated 4 years ago
- XALT: System tracking of users codes on clusters☆48Jan 22, 2026Updated last month
- RollingGantryCrane pulls and converts containers to LMOD modules☆12Oct 25, 2021Updated 4 years ago
- Spank Tunnels☆12Dec 1, 2015Updated 10 years ago
- A daemon that uses cgroups to monitor and manage user behavior on login nodes☆77Jan 23, 2026Updated last month