Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments on Google Cloud.
☆346Jun 15, 2026Updated this week
Alternatives and similar repositories for cluster-toolkit
Users that are interested in cluster-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆68Updated this week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆133Jun 8, 2026Updated last week
- ☆37Aug 12, 2025Updated 10 months ago
- Slurm on Google Cloud Platform☆191Sep 18, 2024Updated last year
- Virtualization Layer for the MPI Profiling Interface☆22Apr 30, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Scripts to build AMD ROCm from source.☆16Oct 31, 2024Updated last year
- Notebooks with codes to do analysis for and make figures for "Submesoscale Vertical Velocities Enhance Tracer Subduction in an Idealized …☆12Sep 26, 2020Updated 5 years ago
- A source-to-source translator for OpenACC to OpenMP.☆16May 18, 2021Updated 5 years ago
- A Template for MLOps on Google Cloud Vertex AI☆13Mar 16, 2022Updated 4 years ago
- ☆35Oct 31, 2025Updated 7 months ago
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆30Jan 9, 2026Updated 5 months ago
- A multi-platform experimentation framework written in python.☆69Jun 11, 2026Updated last week
- Contains reference architecture scripts for running the OpenPiton regression using auto-scaling SLURM cluster.☆22Feb 25, 2026Updated 3 months ago
- AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kub…☆328Jun 23, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MPI Benchmark on AWS HPC cluster☆20Jan 31, 2020Updated 6 years ago
- Terraform modules for deploying DAOS on GCP☆11Jan 17, 2024Updated 2 years ago
- ☆57Jun 8, 2026Updated last week
- Tools to deploy GPU clusters in the Cloud☆34Apr 4, 2023Updated 3 years ago
- ☆49May 5, 2026Updated last month
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆183Jun 11, 2026Updated last week
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- Official BOLT Repository☆33Aug 16, 2024Updated last year
- Ansible role for OpenHPC☆51May 22, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A utility for connecting securely to your AlloyDB instances☆76Jun 10, 2026Updated last week
- Export select slurm metrics to prometheus☆66Feb 19, 2026Updated 3 months ago
- ☆106Jun 11, 2026Updated last week
- AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.☆885Updated this week
- Contains example recipes that demonstrate how to build HPC systems using AWS services and solutions.☆96May 22, 2026Updated 3 weeks ago
- Open Source examples using Google Cloud to solve various Scientific and Technical Computing problems.☆25Apr 6, 2026Updated 2 months ago
- Deploys a large data sharing Golang web app☆14May 10, 2024Updated 2 years ago
- Benchmarks☆19Jun 3, 2026Updated 2 weeks ago
- Container plugin for Slurm Workload Manager☆452May 12, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆22Nov 19, 2025Updated 6 months ago
- A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.☆963Jun 9, 2026Updated last week
- TPU inference for vLLM, with unified JAX and PyTorch support.☆352Updated this week
- ☆16Mar 13, 2025Updated last year
- An external library for delivering Slurm Elastic Computing.☆12Mar 31, 2017Updated 9 years ago
- ☆13Jun 18, 2024Updated 2 years ago
- ☆12Jun 11, 2024Updated 2 years ago