Azure / azhpc-images
Azure HPC/AI VM Images
☆103Updated this week
Alternatives and similar repositories for azhpc-images:
Users that are interested in azhpc-images are comparing it to the libraries listed below
- This repository provides easy automation scripts for building a HPC environment in Azure. It also includes examples to build e2e environm…☆129Updated 4 months ago
- Azure CycleCloud project to enable users to create, configure, and use Slurm HPC clusters.☆62Updated this week
- The Azure HPC On-Demand Platform provides an HPC Cluster Ready solution☆65Updated last week
- Health checks for Azure N- and H-series VMs.☆35Updated this week
- ☆162Updated 2 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆47Updated last month
- Intel HPC Containers using Singularity☆19Updated 2 years ago
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆167Updated this week
- MIG Partition Editor for NVIDIA GPUs☆191Updated this week
- A multi-platform experimentation framework written in python.☆48Updated this week
- RCCL Performance Benchmark Tests☆60Updated 2 weeks ago
- MPI benchmark to test and measure collective performance☆50Updated 3 years ago
- Tools to deploy GPU clusters in the Cloud☆31Updated last year
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆64Updated last week
- Container plugin for Slurm Workload Manager☆327Updated 4 months ago
- MPI Microbenchmarks☆37Updated 8 years ago
- Dragon distributed runtime for HPC and AI applications and workflows☆67Updated 3 months ago
- ROCm Communication Collectives Library (RCCL)☆308Updated this week
- RDC☆27Updated this week
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year
- OCI-compatible engine to deploy Linux containers on HPC environments.☆135Updated 4 months ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆102Updated last week
- A validation and profiling tool for AI infrastructure☆302Updated this week
- ☆331Updated 11 months ago
- HPC System and Software Testing Framework☆68Updated 3 weeks ago
- oneAPI Collective Communications Library (oneCCL)☆227Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆48Updated this week
- Flux tutorial slides and materials☆16Updated 3 weeks ago
- LBNL Node Health Check☆246Updated last month
- RDMA and SHARP plugins for nccl library☆184Updated this week