cea-hpc / clustershell
Scalable cluster administration Python framework — Manage node sets, node groups and execute commands on cluster nodes in parallel.
☆425Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for clustershell
- LBNL Node Health Check☆232Updated last week
- MUNGE (MUNGE Uid 'N' Gid Emporium) is an authentication service for creating and validating user credentials.☆253Updated 3 months ago
- Robinhood Policy Engine : a versatile tool to monitor filesystem contents and schedule actions on filesystem entries.☆181Updated last month
- Open source web dashboard for Slurm HPC clusters☆338Updated this week
- Ganglia Monitoring core☆491Updated 2 years ago
- A high performance, parallel remote shell utility☆487Updated 2 months ago
- IOR and mdtest☆384Updated 2 months ago
- Prometheus exporter for performance metrics from Slurm.☆236Updated 5 months ago
- Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches☆49Updated 5 months ago
- Code repo for xCAT core packages☆369Updated last week
- Sort files and pack them into partitions☆232Updated last month
- File utilities designed for scalability and performance.☆170Updated this week
- Torque Repository☆251Updated last year
- gather and plot data about Slurm scheduling and job statistics☆50Updated 10 years ago
- dcp is a distributed file copy program that automatically distributes and dynamically balances work equally across nodes in a large distr…☆195Updated 5 years ago
- Shifter - Linux Containers for HPC☆352Updated 7 months ago
- Lustre administration tool☆22Updated 3 months ago
- MarFS provides a scalable near-POSIX file system by using one or more POSIX file systems as a scalable metadata component and one or more…☆97Updated 2 weeks ago
- A distributed storage benchmark for file systems, object stores & block devices with support for GPUs☆171Updated 3 weeks ago
- Parallel SSH Tools☆285Updated last year
- Ganglia Web Frontend☆316Updated 6 months ago
- Prometheus exporter for use with the Lustre parallel filesystem☆36Updated 2 years ago
- ☆60Updated 2 months ago
- Now hosted on GitLab.☆312Updated last month
- A daemon that uses cgroups to monitor and manage user behavior on login nodes☆60Updated 3 months ago
- Warewulf is a scalable systems management suite originally developed to manage large high-performance Linux clusters.☆107Updated 7 months ago
- ConMan: The Console Manager☆103Updated 8 months ago
- A coherent Ansible roles collection to simply deploy clusters of nodes.☆115Updated this week
- Run VMs on an HPC cluster☆47Updated 6 months ago
- Python Interface to Slurm☆492Updated this week