dswarbrick / fabricmon
InfiniBand fabric monitoring daemon written in Go
☆30Updated 10 months ago
Alternatives and similar repositories for fabricmon:
Users that are interested in fabricmon are comparing it to the libraries listed below
- Prometheus exporter for use with the Lustre parallel filesystem☆36Updated 2 years ago
- A Slurm-based HPC workload management environment, driven by Ansible.☆52Updated this week
- Bare Metal Provisioning system for HPC Linux clusters☆58Updated this week
- Slurm Lua SPANK plugin☆16Updated 2 years ago
- ☆13Updated 3 years ago
- Exposes Baseboard Management Controller data in Prometheus format.☆49Updated 3 months ago
- Prometheus exporter for a Infiniband Fabric☆57Updated last year
- Prometheus exporter for use with the Lustre parallel filesystem☆21Updated 3 months ago
- Scripts for monitoring InfiniBand and storage devices☆11Updated 9 years ago
- Lustre Monitoring System☆21Updated last year
- YAML-based database of datacenter infrastructures☆15Updated last month
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆28Updated 5 months ago
- Lustre Monitoring System based on Collectd, Grafana and Influxdb☆44Updated last year
- A tool to generate slurm topology configuration from infiniband network discovery.☆21Updated 8 years ago
- User Fencing Tools☆16Updated 2 years ago
- Monitoring and visualization of InfiniBand Fabrics☆20Updated 3 years ago
- Lustre Monitoring Tools☆72Updated 3 months ago
- Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches☆53Updated last month
- CLI tool for manipulating Ceph's upmap exception table.☆53Updated last month
- KNoC is a Kubernetes Virtual Kubelet that uses an HPC cluster as the container execution environment☆18Updated last year
- Lustre administration tool☆22Updated 6 months ago
- Kraken is a distributed state engine framework for scalable automation and orchestration tools.☆55Updated last year
- This web portal is intended to give HPC users a view of the overall use of the HPC cluster and their own use.☆29Updated last month
- Spectrum Scale Installation and Configuration☆67Updated this week
- Integrated Manager for Lustre☆72Updated 3 years ago
- A distributed storage benchmark for file systems, object stores & block devices with support for GPUs☆180Updated this week
- Kerberos credential support for batch environments☆14Updated 6 months ago
- The operator manages the ovn-kube components running on the DPU card for enabling OVS hardware offloading.☆27Updated 2 months ago
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆19Updated 2 years ago
- ☆59Updated 4 months ago