NVIDIA/cloudai

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/cloudai)

NVIDIA / cloudai

CloudAI Benchmark Framework

☆96

Alternatives and similar repositories for cloudai

Users that are interested in cloudai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mlcommons / chakra-old
View on GitHub
Repository for MLCommons Chakra schema and tools
☆38Dec 24, 2023Updated 2 years ago
mannheim-network / spacex
View on GitHub
The new generation of meta-universe web3.0 infrastructure, mannheim is a fast, distributed, and creator-friendly Blockchain for UGC devel…
☆10May 16, 2022Updated 4 years ago
facebookresearch / torch_ucc
View on GitHub
Pytorch process group third-party plugin for UCC
☆22Apr 15, 2024Updated 2 years ago
NVIDIA / nvloom
View on GitHub
nvloom is a set of tools designed to scalably test MNNVL fabrics.
☆50Apr 1, 2026Updated 3 months ago
NVIDIA / doroce-linux
View on GitHub
A command line utility to manage the configuration of a system's high performance network interfaces for RoCE deployments
☆36Jul 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / param
View on GitHub
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…
☆155Jul 2, 2026Updated 2 weeks ago
aws-samples / ec2-topology-aware-for-slurm
View on GitHub
☆13May 30, 2025Updated last year
romain-jacob / triscale
View on GitHub
TriScale software
☆14Apr 23, 2024Updated 2 years ago
networkop / cue-networking
View on GitHub
Example of using CUE to model baremetal network configurations
☆47Dec 30, 2021Updated 4 years ago
NVIDIA / topograph
View on GitHub
A toolkit for discovering cluster network topology.
☆144Updated this week
ParCoreLab / Snoopie
View on GitHub
Multi-GPU communication profiler and visualizer
☆43Jun 10, 2024Updated 2 years ago
mlcommons / chakra
View on GitHub
Repository for MLCommons Chakra schema and tools
☆185May 20, 2026Updated 2 months ago
Mellanox / ufm_sdk_3.0
View on GitHub
☆28Updated this week
cpmarvin / lnetd-ctl
View on GitHub
WIP for experimenting an opensource SR controller
☆14Mar 23, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Apstra / iba
View on GitHub
Apstra AOS IBA Probe Library
☆12Oct 5, 2020Updated 5 years ago
rtbrick / bgpdump2
View on GitHub
Bgpdump2: A Tool to Read and Compare the BGP RIB Dump Files.
☆16Jul 5, 2023Updated 3 years ago
ofiwg / fabtests
View on GitHub
FROZEN: the master branch has merged with the libfabric git repo
☆31Oct 3, 2018Updated 7 years ago
nleiva / gmessaging
View on GitHub
GPB and gRPC testing
☆14May 6, 2022Updated 4 years ago
microsoft / NPKit
View on GitHub
NCCL Profiling Kit
☆155Jul 1, 2024Updated 2 years ago
boweiliu / nccl
View on GitHub
Optimized primitives for collective multi-GPU communication
☆11May 8, 2024Updated 2 years ago
reiverjohn / biobench2
View on GitHub
Bioinformatics benchmarking package, based on the original BioBench developed by Albayraktaroglu et al, 2005
☆13Dec 13, 2018Updated 7 years ago
microsoft / superbenchmark
View on GitHub
A validation and profiling tool for AI infrastructure
☆382Updated this week
Wigner-GPU-Lab / SYCL-PRNG
View on GitHub
A pseudo random number generator library written against the SYCL API.
☆11Jun 11, 2019Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ekondis / gpuroofperf-toolkit
View on GitHub
A GPU performance prediction toolkit for CUDA programs
☆18Mar 25, 2019Updated 7 years ago
aliyun / aicb
View on GitHub
☆237Jul 2, 2026Updated 2 weeks ago
netbench / GPCNET
View on GitHub
☆44Jun 3, 2024Updated 2 years ago
openucx / ucc
View on GitHub
Unified Collective Communication Library
☆310Updated this week
MLNetwork / rostam
View on GitHub
☆25May 26, 2021Updated 5 years ago
JiaweiZhuang / aws-mpi-benchmark
View on GitHub
MPI Benchmark on AWS HPC cluster
☆20Jan 31, 2020Updated 6 years ago
argonne-lcf / dlio_benchmark
View on GitHub
An I/O benchmark for deep Learning applications
☆108Jun 18, 2026Updated last month
NVIDIA / nvidia-hpcg
View on GitHub
NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.
☆70Jul 7, 2026Updated last week
NVIDIA / nodewright
View on GitHub
A Kubernetes Operator to manage Node OS customizations.
☆58Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
poolpOrg / fion
View on GitHub
repository for the fion window manager
☆10Oct 10, 2025Updated 9 months ago
forresti / osu-micro-benchmarks
View on GitHub
MPI Microbenchmarks
☆48Apr 16, 2016Updated 10 years ago
NVIDIA / pyxis
View on GitHub
Container plugin for Slurm Workload Manager
☆453May 12, 2026Updated 2 months ago
mlcommons / training_results_v4.0
View on GitHub
This repository contains the results and code for the MLPerf™ Training v4.0 benchmark.
☆13Jun 11, 2024Updated 2 years ago
sflow-rt / containerlab
View on GitHub
Experiment with real-time network telemetry using containerlab
☆46Apr 18, 2026Updated 3 months ago
ntrdma / ntrdma
View on GitHub
Linux tree for ntrdma driver development.
☆11Jun 29, 2017Updated 9 years ago
intel / psm
View on GitHub
☆13Aug 4, 2022Updated 3 years ago