GoogleCloudPlatform / cluster-toolkit
Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments on Google Cloud.
☆188Updated this week
Related projects: ⓘ
- Slurm on Google Cloud Platform☆178Updated this week
- ☆35Updated 2 months ago
- ☆79Updated 3 months ago
- ☆24Updated last week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆69Updated this week
- Container plugin for Slurm Workload Manager☆278Updated last month
- A sample integration of AWS services with SLURM☆71Updated last year
- RAD Lab enables users to deploy infrastructure on Google Cloud Platform (GCP) to support specific use cases. Infrastructure is created an…☆94Updated this week
- Azure CycleCloud project to enable users to create, configure, and use Slurm HPC clusters.☆56Updated 3 weeks ago
- ☆35Updated 2 months ago
- Tools to deploy GPU clusters in the Cloud☆30Updated last year
- Terraform config for Cluster in the Cloud☆20Updated 5 months ago
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆217Updated last week
- Contains example recipes that demonstrate how to build HPC systems using AWS services and solutions.☆60Updated 3 weeks ago
- The Singularity implementation of the Kubernetes Container Runtime Interface☆114Updated 3 years ago
- Dragon distributed runtime for HPC and AI applications and workflows☆52Updated 2 months ago
- OCI-compatible engine to deploy Linux containers on HPC environments.☆129Updated 2 weeks ago
- A multi-platform experimentation framework written in python.☆38Updated this week
- Singularity implementation of k8s operator for interacting with SLURM.☆118Updated 3 years ago
- You should offer both Podman and Apptainer with name spaces on your HPC systems☆48Updated 5 months ago
- Rapid HPC Orchestration in the Cloud☆28Updated 11 months ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64Updated 2 weeks ago
- core services for the Flux resource management framework☆167Updated this week
- Local filesystem registry for containers (intended for HPC) using Lmod or Environment Modules. Works for users and admins.☆111Updated 6 months ago
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆62Updated 7 months ago
- ☆37Updated 2 weeks ago
- HPC Container Maker☆447Updated this week
- Running High Performance Computing (HPA) applications on EKS using Elastic Fabric Adapter (EFA).☆8Updated 3 years ago
- GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.☆40Updated 9 months ago
- Singularity 101☆30Updated 4 years ago