Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments on Google Cloud.
☆340May 22, 2026Updated last week
Alternatives and similar repositories for cluster-toolkit
Users that are interested in cluster-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆66May 20, 2026Updated last week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆133May 22, 2026Updated last week
- ☆37Aug 12, 2025Updated 9 months ago
- Slurm on Google Cloud Platform☆190Sep 18, 2024Updated last year
- Virtualization Layer for the MPI Profiling Interface☆22Apr 30, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch implemetation of multi-layer quasi-geostrophic model on rectangular domain with solid boundaries.☆14May 9, 2022Updated 4 years ago
- Scripts to build AMD ROCm from source.☆16Oct 31, 2024Updated last year
- Notebooks with codes to do analysis for and make figures for "Submesoscale Vertical Velocities Enhance Tracer Subduction in an Idealized …☆12Sep 26, 2020Updated 5 years ago
- A source-to-source translator for OpenACC to OpenMP.☆16May 18, 2021Updated 5 years ago
- A Template for MLOps on Google Cloud Vertex AI☆13Mar 16, 2022Updated 4 years ago
- A multi-platform experimentation framework written in python.☆67May 22, 2026Updated last week
- Contains reference architecture scripts for running the OpenPiton regression using auto-scaling SLURM cluster.☆24Feb 25, 2026Updated 3 months ago
- AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kub…☆328Jun 23, 2025Updated 11 months ago
- MPI Benchmark on AWS HPC cluster☆20Jan 31, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Terraform modules for deploying DAOS on GCP☆11Jan 17, 2024Updated 2 years ago
- ☆63Mar 25, 2026Updated 2 months ago
- ☆57May 19, 2026Updated last week
- Tools to deploy GPU clusters in the Cloud☆34Apr 4, 2023Updated 3 years ago
- ☆49May 5, 2026Updated 3 weeks ago
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆183May 14, 2026Updated 2 weeks ago
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- Official BOLT Repository☆33Aug 16, 2024Updated last year
- Ansible role for OpenHPC☆52May 22, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Export select slurm metrics to prometheus☆66Feb 19, 2026Updated 3 months ago
- AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.☆886May 22, 2026Updated last week
- Contains example recipes that demonstrate how to build HPC systems using AWS services and solutions.☆95May 22, 2026Updated last week
- Open Source examples using Google Cloud to solve various Scientific and Technical Computing problems.☆25Apr 6, 2026Updated last month
- Deploys a large data sharing Golang web app☆14May 10, 2024Updated 2 years ago
- ☆15May 12, 2026Updated 2 weeks ago
- Container plugin for Slurm Workload Manager☆447May 12, 2026Updated 2 weeks ago
- A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.☆942May 19, 2026Updated last week
- ☆16Mar 13, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Create a secure ML environment on Vertex AI☆37May 22, 2026Updated last week
- An external library for delivering Slurm Elastic Computing.☆12Mar 31, 2017Updated 9 years ago
- ☆13Jun 18, 2024Updated last year
- ☆12Jun 11, 2024Updated last year
- ☆15Jun 8, 2017Updated 8 years ago
- Standalone Spack Tutorial Repository☆52May 20, 2026Updated last week
- Run Slurm on Kubernetes. A Slinky project.☆301May 22, 2026Updated last week