NVIDIA/ib-traffic-monitor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/ib-traffic-monitor)

NVIDIA / ib-traffic-monitor

A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node

☆71

Alternatives and similar repositories for ib-traffic-monitor

Users that are interested in ib-traffic-monitor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rackslab / slurm-quota
View on GitHub
CPU/GPU time quotas for users & accounts in Slurm
☆17Jul 9, 2026Updated last week
NVIDIA / dgxc-benchmarking
View on GitHub
DGXC Benchmarking provides recipes in ready-to-use templates for evaluating performance of specific AI use cases across hardware and soft…
☆98Jul 6, 2026Updated 2 weeks ago
cea-hpc / pcvs-benchmarks
View on GitHub
Parallel Computing -- Validation Suite: Validation engine for Exascale project benchmarks
☆16Mar 26, 2026Updated 3 months ago
stanford-rc / ibswinfo
View on GitHub
Command-line tool to retrieve information and monitor Mellanox un-managed Infiniband switches
☆77Nov 17, 2025Updated 8 months ago
eunomia-bpf / nccl-eBPF
View on GitHub
☆20Jul 7, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
stackhpc / ansible-slurm-appliance
View on GitHub
A Slurm-based HPC workload management environment, driven by Ansible.
☆72Updated this week
infiniband-radar / infiniband-radar-daemon
View on GitHub
☆15Nov 25, 2021Updated 4 years ago
icl-utk-edu / hpl
View on GitHub
☆16Jul 25, 2021Updated 4 years ago
ishandhanani / srt-slurm
View on GitHub
Benchmark SGLang on SLURM
☆24Apr 20, 2026Updated 3 months ago
MrBr-github / lshca
View on GitHub
☆13Mar 3, 2025Updated last year
thediymaker / slurm-node-dashboard
View on GitHub
Slurm HPC node status page
☆75Jul 6, 2026Updated 2 weeks ago
hmxlabs / hpc-catalog
View on GitHub
A community driven catalog of tools and products that are useful in the world of high performance computing (HPC)
☆11Jul 3, 2025Updated last year
FindHao / drgpu
View on GitHub
A Top-Down Profiler for GPU Applications
☆23Feb 29, 2024Updated 2 years ago
takahiro-hirofuchi / mesmeric-emulator
View on GitHub
MESMERIC: A Software-based NVM Emulator Supporting Read/Write Asymmetric Latencies
☆10Oct 1, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NVIDIA / gpu_affinity
View on GitHub
GPU Affinity is a package to automatically set the CPU process affinity to match the hardware architecture on a given platform
☆29Dec 8, 2023Updated 2 years ago
TiledTensor / TiledBench
View on GitHub
Benchmark tests supporting the TiledCUDA library.
☆19Nov 19, 2024Updated last year
xjdr-alt / mla_blog_translation
View on GitHub
☆13Jun 18, 2024Updated 2 years ago
stanford-rc / slurm-spank-stunnel
View on GitHub
Slurm SPANK plugin to ease setup of SSH tunnels and port forwarding
☆12Mar 21, 2024Updated 2 years ago
jhammond / ibtop
View on GitHub
monitor InfiniBand usage by job or host
☆26Jan 11, 2012Updated 14 years ago
ecrc / hicma
View on GitHub
HiCMA: Hierarchical Computations on Manycore Architectures
☆37Mar 19, 2023Updated 3 years ago
Infrawaves / DeepEP_ibrc_dual-ports_multiQP
View on GitHub
Aims to implement dual-port and multi-qp solutions in deepEP ibrc transport
☆75May 9, 2025Updated last year
NVIDIA / nv-one-logger
View on GitHub
nv-one-logger enables tracking of GPU application progress over time and can help to identify overhead from workload and cluster ineffici…
☆24Nov 6, 2025Updated 8 months ago
stanford-rc / slurm-spank-lua
View on GitHub
Slurm Lua SPANK plugin
☆17Jan 30, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NVIDIA / ansible-collection-dpu-ops
View on GitHub
NVIDIA DPU OPs collection
☆15Mar 6, 2023Updated 3 years ago
mxinden / elimination-backoff-stack
View on GitHub
Lock-free elimination back-off stack
☆12Jan 6, 2022Updated 4 years ago
howardlau1999 / rdmapp
View on GitHub
C++ interfaces for RDMA access
☆84Jul 13, 2026Updated last week
GSI-HPC / lustre_exporter
View on GitHub
Prometheus exporter for use with the Lustre parallel filesystem
☆30Jul 1, 2026Updated 2 weeks ago
whamcloud / lustrefs-exporter
View on GitHub
Prometheus exporter for lustre
☆27Updated this week
llnl / lustre
View on GitHub
LLNL's branches of Lustre
☆63May 29, 2026Updated last month
breuner / elfindo
View on GitHub
A parallel find tool for Linux
☆16Dec 20, 2025Updated 7 months ago
NVIDIA / srt-slurm
View on GitHub
NVIDIA Inference Benchmarks provide recipes in ready-to-use templates for evaluating platform speed. Validate your platform across speci…
☆40Updated this week
breuner / elbencho
View on GitHub
A distributed storage benchmark for file systems, object stores & block devices with support for GPUs
☆281Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
bsc-pm / dlb
View on GitHub
DLB (Dynamic Load Balancing) library is a tool, transparent to the user, that will dynamically react to the application imbalance modifyi…
☆32Updated this week
cyberang3l / InfiniBand-Graphviz-ualization
View on GitHub
Generate graphviz dot files from InfiniBand topology dumps.
☆17Feb 11, 2024Updated 2 years ago
meta-pytorch / torchft
View on GitHub
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆523Updated this week
buzh / slop
View on GitHub
A `top`-like utility for the Slurm HPC batch job scheduler
☆15Jun 9, 2026Updated last month
JiangLiSJTU / token-ring
View on GitHub
☆13Jan 7, 2025Updated last year
guilbaults / slurm-job-exporter
View on GitHub
Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.
☆49Apr 15, 2026Updated 3 months ago
NVIDIA / go-nvlib
View on GitHub
A collection of useful Go libraries for use with NVIDIA GPU management tools
☆57Updated this week