LLNL / LaunchMON

LaunchMON is a software infrastructure that enables HPC run-time tools to co-locate tool daemons with a parallel job. Its API allows a tool to identify all the remote processes of a job and to scalably launch daemons into the relevant nodes.
13Updated 2 years ago

Related projects

Alternatives and complementary repositories for LaunchMON