Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments on Google Cloud.
☆323Mar 2, 2026Updated this week
Alternatives and similar repositories for cluster-toolkit
Users that are interested in cluster-toolkit are comparing it to the libraries listed below
Sorting:
- ☆61Updated this week
- Slurm on Google Cloud Platform☆190Sep 18, 2024Updated last year
- ☆37Aug 12, 2025Updated 6 months ago
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆27Jan 9, 2026Updated last month
- ☆48Jan 5, 2026Updated last month
- A Template for MLOps on Google Cloud Vertex AI☆13Mar 16, 2022Updated 3 years ago
- A multi-platform experimentation framework written in python.☆64Feb 21, 2026Updated last week
- PyTorch implemetation of multi-layer quasi-geostrophic model on rectangular domain with solid boundaries.☆14May 9, 2022Updated 3 years ago
- ☆61Jan 20, 2026Updated last month
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆170Updated this week
- ☆16Mar 13, 2025Updated 11 months ago
- Scripts to build AMD ROCm from source.☆16Oct 31, 2024Updated last year
- Contains example recipes that demonstrate how to build HPC systems using AWS services and solutions.☆90Feb 10, 2026Updated 2 weeks ago
- AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kub…☆327Jun 23, 2025Updated 8 months ago
- Container plugin for Slurm Workload Manager☆416Feb 18, 2026Updated last week
- ☆17Apr 9, 2025Updated 10 months ago
- Deploys a large data sharing Golang web app☆14May 10, 2024Updated last year
- Open Source examples using Google Cloud to solve various Scientific and Technical Computing problems.☆23Updated this week
- DXT Explorer is an interactive web-based log analysis tool for Darshan DXT logs.☆17Feb 19, 2026Updated last week
- AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.☆886Updated this week
- Benchmarks☆18Apr 28, 2025Updated 10 months ago
- ☆57Dec 12, 2025Updated 2 months ago
- Fluxion Graph-based Scheduler☆107Feb 10, 2026Updated 2 weeks ago
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆34Feb 17, 2024Updated 2 years ago
- Ansible role for OpenHPC☆51Updated this week
- Standalone Spack Tutorial Repository☆52Jan 25, 2026Updated last month
- A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.☆903Feb 18, 2026Updated last week
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆21Jul 11, 2022Updated 3 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- FIX metadata model☆11Nov 18, 2015Updated 10 years ago
- Pragmatic, Productive, and Portable Affinity for HPC☆51Updated this week
- Julia HPC miniapp using parallel models (MPI.jl, CUDA.jl, AMDGPU.jl, ADIOS2.jl) and Jupyter/Pluto.jl notebooks☆24Jan 28, 2026Updated last month
- A project and machine deployment model using Spack☆29Feb 5, 2026Updated 3 weeks ago
- ☆10May 30, 2020Updated 5 years ago
- A testing framework and a set of test suites used for testing GCE Images.☆15Feb 24, 2026Updated last week
- ☆13Jun 18, 2024Updated last year
- ExaWorks SDK☆11Feb 1, 2024Updated 2 years ago
- Mini-applications that exclusively use the Kokkos programming model☆12Mar 21, 2023Updated 2 years ago