bacaldwell / scalable-monitoring
Scripts for monitoring InfiniBand and storage devices
☆11Updated 9 years ago
Alternatives and similar repositories for scalable-monitoring:
Users that are interested in scalable-monitoring are comparing it to the libraries listed below
- InfiniBand fabric monitoring daemon written in Go☆30Updated last year
- A tool to generate slurm topology configuration from infiniband network discovery.☆21Updated 8 years ago
- Scan Singularity container images using a Clair server☆16Updated 3 years ago
- Lustre Monitoring Tools☆72Updated 5 months ago
- Prometheus exporter for use with the Lustre parallel filesystem☆37Updated 2 years ago
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆20Updated 2 years ago
- Prometheus exporter for use with the Lustre parallel filesystem☆22Updated 5 months ago
- OGRT Runtime Tracker☆11Updated 5 years ago
- A terminal based monitoring tool for InfiniBand networks using Detector (https://github.com/hhu-bsinfo/detector)☆13Updated 5 years ago
- Prometheus exporter for a Infiniband Fabric☆59Updated last year
- Dynamic Registry Proxy☆15Updated 2 years ago
- Cluster stack based on Salt☆18Updated 5 years ago
- Lester, the Lustre lister; quickly scan MDT to generate lists of file matching given criteria☆14Updated 3 years ago
- Enables HPC Environment in an OpenStack Cloud☆11Updated 7 years ago
- Full configuration of FredHutch Scratch File System using commodity disks, Ubuntu ZFS and BeeGFS☆10Updated 6 years ago
- Slurm SPANK plugin to let users change GPU compute mode in jobs☆12Updated 2 years ago
- Cerebro is a collection of cluster monitoring tools and libraries.☆17Updated 7 months ago
- stable lustre sources☆26Updated 5 years ago
- Some lustre-related scripts and utilities in use at LLNL.☆25Updated 2 months ago
- Create beegfs server and client☆24Updated 3 years ago
- ☆28Updated 5 years ago
- KNoC is a Kubernetes Virtual Kubelet that uses an HPC cluster as the container execution environment☆20Updated 2 years ago
- Custom Slurm tools☆25Updated 6 years ago
- A Slurm-based HPC workload management environment, driven by Ansible.☆56Updated this week
- User space POSIX-like file system in main memory☆38Updated 8 years ago
- Lustre administration tool☆22Updated 8 months ago
- continuous Lustre load monitor☆21Updated 9 years ago
- SLURM Bank, a collection of wrapper scripts to give slurm GOLD like capabilities for managing resources.☆24Updated 6 years ago
- Lustre Monitoring System☆23Updated 3 weeks ago
- A collection of diamond collectors for slurm.☆15Updated last year