cluslab / metastackLinks
Metastack: an enhanced and performance optimized version of Slurm
☆53Updated this week
Alternatives and similar repositories for metastack
Users that are interested in metastack are comparing it to the libraries listed below
Sorting:
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 9 months ago
- ☆15Updated 4 months ago
- ☆172Updated 2 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆63Updated last month
- Super Computing On Web☆304Updated this week
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆103Updated last month
- A distributed scheduling system for HPC and AI workloads☆125Updated this week
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆34Updated 2 months ago
- MANA for MPI☆45Updated 2 months ago
- core services for the Flux resource management framework☆189Updated last week
- Slurm in Docker - Exploring Slurm using CentOS 7 based Docker images☆129Updated 6 years ago
- Fluxion Graph-based Scheduler☆101Updated 2 weeks ago
- OCI-compatible engine to deploy Linux containers on HPC environments.☆141Updated last year
- UnifyFS: A file system for burst buffers☆118Updated 2 months ago
- Slurm Simulator: Slurm Modification to Enable its Simulation☆37Updated last year
- A Slurm cluster using docker-compose☆415Updated this week
- HPC Monitoring Tool☆35Updated last week
- ☆133Updated 2 weeks ago
- nvloom is a set of tools designed to scalably test MNNVL fabrics.☆33Updated 3 weeks ago
- A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node☆43Updated 2 months ago
- oneAPI Collective Communications Library (oneCCL)☆247Updated 3 weeks ago
- Heavy Peer To Peer: a MPI based benchmark for network diagnostic☆24Updated 8 months ago
- Intel HPC Containers using Singularity☆19Updated 2 years ago
- File utilities designed for scalability and performance.☆190Updated 3 months ago
- HPC Container Maker☆499Updated last month
- Unified Collective Communication Library☆279Updated this week
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆14Updated 3 weeks ago
- Bandwidth test for ROCm☆69Updated last week
- OpenPMIx Project Repository☆252Updated last week
- MPI Microbenchmarks☆46Updated 9 years ago