Azure / azhpc-images
Azure HPC/AI VM Images
☆101Updated last week
Alternatives and similar repositories for azhpc-images:
Users that are interested in azhpc-images are comparing it to the libraries listed below
- This repository provides easy automation scripts for building a HPC environment in Azure. It also includes examples to build e2e environm…☆128Updated 3 months ago
- Azure CycleCloud project to enable users to create, configure, and use Slurm HPC clusters.☆62Updated this week
- The Azure HPC On-Demand Platform provides an HPC Cluster Ready solution☆65Updated 2 weeks ago
- Health checks for Azure N- and H-series VMs.☆32Updated 2 weeks ago
- A multi-platform experimentation framework written in python.☆47Updated this week
- RCCL Performance Benchmark Tests☆59Updated last month
- ☆159Updated last month
- Reference implementations of MLPerf™ HPC training benchmarks☆45Updated 8 months ago
- Dragon distributed runtime for HPC and AI applications and workflows☆65Updated 2 months ago
- Intel HPC Containers using Singularity☆19Updated 2 years ago
- A validation and profiling tool for AI infrastructure☆294Updated this week
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Updated 3 years ago
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆163Updated this week
- A tool for bandwidth measurements on NVIDIA GPUs.☆364Updated 2 weeks ago
- Fluxion Graph-based Scheduler☆92Updated last week
- HPCPerfStats is an automated resource-usage monitoring and analysis package.☆46Updated this week
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆101Updated 3 months ago
- ☆37Updated 8 months ago
- Tools to deploy GPU clusters in the Cloud☆30Updated last year
- Darshan I/O characterization tool☆61Updated this week
- Magnum IO community repo☆84Updated last month
- RDC☆26Updated this week
- Microsoft Collective Communication Library☆62Updated 3 months ago
- RDMA and SHARP plugins for nccl library☆176Updated last month
- ROCm Communication Collectives Library (RCCL)☆297Updated this week
- Container plugin for Slurm Workload Manager☆320Updated 3 months ago
- ☆19Updated 3 months ago
- Integrated Performance Monitoring for High Performance Computing☆88Updated 3 years ago
- Pavilion is a Python 3 (3.5+) based framework for running and analyzing tests targeting HPC systems.☆44Updated this week
- An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments☆37Updated this week