DGXC Benchmarking provides recipes in ready-to-use templates for evaluating performance of specific AI use cases across hardware and software combinations.
☆97May 23, 2026Updated last month
Alternatives and similar repositories for dgxc-benchmarking
Users that are interested in dgxc-benchmarking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 30, 2025Updated last year
- A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node☆69May 16, 2026Updated last month
- nv-one-logger enables tracking of GPU application progress over time and can help to identify overhead from workload and cluster ineffici…☆23Nov 6, 2025Updated 7 months ago
- Linux Sysinfo Snapshot☆66Jun 7, 2026Updated 3 weeks ago
- A community driven catalog of tools and products that are useful in the world of high performance computing (HPC)☆11Jul 3, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage☆90May 12, 2026Updated last month
- nvloom is a set of tools designed to scalably test MNNVL fabrics.☆49Apr 1, 2026Updated 3 months ago
- InfiniBand fabric monitoring daemon written in Go☆32May 22, 2025Updated last year
- A Slurm-based HPC workload management environment, driven by Ansible.☆72Jun 25, 2026Updated last week
- A toolkit for discovering cluster network topology.☆135Updated this week
- OLCF Test Harness☆14May 26, 2026Updated last month
- NVIDIA Fleet Command is a hybrid-cloud platform for securely and remotely deploying, managing, and scaling AI across dozens or up to thou…☆16Jul 20, 2022Updated 3 years ago
- Parallel Computing -- Validation Suite: Validation engine for Exascale project benchmarks☆16Mar 26, 2026Updated 3 months ago
- Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes☆339Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- VASTPY is the official Python SDK for the VAST Management System☆21Mar 26, 2026Updated 3 months ago
- Scripts to customize AWS ParallelCluster☆29Jun 11, 2026Updated 3 weeks ago
- Optimized primitives for collective multi-GPU communication☆11May 8, 2024Updated 2 years ago
- Python wrappers for the FirecREST API☆12Jun 21, 2026Updated last week
- A distributed storage benchmark for file systems, object stores & block devices with support for GPUs☆274Jun 21, 2026Updated last week
- Create an Amazon EKS cluster and run a distributed training example☆29Aug 19, 2024Updated last year
- Utility for monitoring process, thread, OS and HW resources.☆20May 20, 2026Updated last month
- CGFDM3D-EQR: A Platform for Rapid Response to Earthquake Disasters in 3D Complex Media☆21May 29, 2023Updated 3 years ago
- This repository contains the results and code for the MLPerf™ Training v4.0 benchmark.☆13Jun 11, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PLASMA parallel library for dense linear algebra.☆10May 30, 2017Updated 9 years ago
- Multi-GPU communication profiler and visualizer☆42Jun 10, 2024Updated 2 years ago
- RPerf: Accurate Latency Measurement Framework for RDMA☆15Apr 14, 2026Updated 2 months ago
- Aries Network Performance Counters Monitoring Library☆11Nov 19, 2020Updated 5 years ago
- Empirical-Research Toolkit☆11Jun 21, 2026Updated last week
- Information for the Intro to Cluster System Administration for Non-Sysadmins class☆10Dec 12, 2021Updated 4 years ago
- Sample Codes using NVSHMEM on Multi-GPU☆30Jan 22, 2023Updated 3 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆24Apr 26, 2018Updated 8 years ago
- ☆10Dec 18, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pavilion is a Python 3 (3.6+) based framework for running and analyzing tests targeting HPC systems.☆46Jun 25, 2026Updated last week
- Show differences between directory trees☆15Aug 9, 2025Updated 10 months ago
- a model of deepfm using keras☆12Apr 2, 2019Updated 7 years ago
- pytorch code examples for measuring the performance of collective communication calls in AI workloads☆21Sep 18, 2025Updated 9 months ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆12Sep 15, 2025Updated 9 months ago
- A complete CUDA tutorial ranging from first GPU programs to advanced asynchronous methods☆30Jan 22, 2026Updated 5 months ago