Azure / azhpc-imagesLinks
Azure HPC/AI VM Images
☆107Updated this week
Alternatives and similar repositories for azhpc-images
Users that are interested in azhpc-images are comparing it to the libraries listed below
Sorting:
- This repository provides easy automation scripts for building a HPC environment in Azure. It also includes examples to build e2e environm…☆130Updated 7 months ago
- Azure CycleCloud project to enable users to create, configure, and use Slurm HPC clusters.☆65Updated this week
- The Azure HPC On-Demand Platform provides an HPC Cluster Ready solution☆67Updated last month
- Health checks for Azure N- and H-series VMs.☆41Updated last month
- ☆165Updated last month
- A multi-platform experimentation framework written in python.☆53Updated last week
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 3 months ago
- Intel HPC Containers using Singularity☆19Updated 2 years ago
- A validation and profiling tool for AI infrastructure☆312Updated this week
- RCCL Performance Benchmark Tests☆67Updated last week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆86Updated this week
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆65Updated last week
- MPI benchmark to test and measure collective performance☆50Updated 3 years ago
- Dragon distributed runtime for HPC and AI applications and workflows☆72Updated 2 weeks ago
- Container plugin for Slurm Workload Manager☆343Updated 6 months ago
- Slurm Simulator: Slurm Modification to Enable its Simulation☆34Updated last year
- Python bindings for UCX☆135Updated this week
- Jobstats is a job monitoring platform for CPU and GPU clusters☆74Updated last month
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆173Updated this week
- ☆346Updated last year
- HPCPerfStats (formerly TACC Stats) is an automated resource-usage monitoring and analysis package for HPC Clusters.☆46Updated 2 weeks ago
- Darshan I/O characterization tool☆64Updated this week
- Tools to deploy GPU clusters in the Cloud☆31Updated 2 years ago
- MPI Microbenchmarks☆39Updated 9 years ago
- OCI-compatible engine to deploy Linux containers on HPC environments.☆139Updated 7 months ago
- NVIDIA NCCL Tests for Distributed Training☆91Updated 2 weeks ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated 2 months ago
- Synthesizer for optimal collective communication algorithms☆106Updated last year
- Distributed AI/HPC Monitoring Framework☆27Updated last month
- Flux tutorial slides and materials☆18Updated this week