GoogleCloudPlatform / cluster-toolkit
Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments on Google Cloud.
☆235Updated this week
Alternatives and similar repositories for cluster-toolkit:
Users that are interested in cluster-toolkit are comparing it to the libraries listed below
- Slurm on Google Cloud Platform☆183Updated 6 months ago
- ☆38Updated this week
- ☆35Updated this week
- Container plugin for Slurm Workload Manager☆327Updated 4 months ago
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆47Updated this week
- Azure CycleCloud project to enable users to create, configure, and use Slurm HPC clusters.☆62Updated this week
- MIG Partition Editor for NVIDIA GPUs☆191Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆107Updated this week
- Tools to deploy GPU clusters in the Cloud☆31Updated last year
- A multi-platform experimentation framework written in python.☆48Updated this week
- The Singularity implementation of the Kubernetes Container Runtime Interface☆114Updated 4 years ago
- ☆81Updated 5 months ago
- Singularity implementation of k8s operator for interacting with SLURM.☆117Updated 4 years ago
- OCI-compatible engine to deploy Linux containers on HPC environments.☆134Updated 4 months ago
- Azure HPC/AI VM Images☆103Updated this week
- 📚 👨🔬 👩🔬 Discussion and advancement of Research Computing using Cloud Native technologies☆78Updated 2 years ago
- core services for the Flux resource management framework☆176Updated this week
- Export select slurm metrics to prometheus☆49Updated this week
- HPC Container Maker☆471Updated last week
- RAD Lab enables users to deploy infrastructure on Google Cloud Platform (GCP) to support specific use cases. Infrastructure is created an…☆99Updated last week
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆63Updated last year
- Fluxion Graph-based Scheduler☆94Updated last week
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆239Updated this week
- ☆41Updated last month
- A sample integration of AWS services with SLURM☆73Updated last year
- ☆43Updated 2 months ago
- Rapid HPC Orchestration in the Cloud☆28Updated last year
- ☆97Updated 5 months ago
- Dragon distributed runtime for HPC and AI applications and workflows☆67Updated 3 months ago
- Ansible role for installing and managing the Slurm Workload Manager☆99Updated 2 months ago