deepsquare-io / ClusterFactory
Kubernetes-based infrastructure orchestration tool that automate the process of deploying, managing and monitoring compute-optimized clusters from bare metal servers to VMs and containers.
☆32Updated this week
Related projects ⓘ
Alternatives and complementary repositories for ClusterFactory
- The DeepSquare Grid is a decentralized HPC based on Blockchain, in Solidity and Go, with an abstracted SLURM interface and meta-schedulin…☆11Updated 2 months ago
- DeepSquare Workflow Catalog, including both starter workflow files and community-contributed ones.☆11Updated 8 months ago
- User-facing portal to all DeepSquare applications running on the Grid☆10Updated last week
- DeepSquare typescript SDK☆10Updated 9 months ago
- A Slurm-based HPC workload management environment, driven by Ansible.☆51Updated this week
- Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.☆27Updated 3 months ago
- A coherent Ansible roles collection to simply deploy clusters of nodes.☆116Updated this week
- Bare Metal Provisioning system for HPC Linux clusters☆58Updated this week
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆227Updated this week
- Slurm in Docker - Exploring Slurm using CentOS 7 based Docker images☆120Updated 5 years ago
- ☆14Updated last year
- YAML-based database of datacenter infrastructures☆15Updated 3 months ago
- Ansible role for OpenHPC☆47Updated last week
- Tutorial for installing Open XDMoD, OnDemand, & ColdFront☆121Updated 4 months ago
- Monitoring and visualization of InfiniBand Fabrics☆19Updated 3 years ago
- Ansible role for installing and managing the Slurm Workload Manager☆88Updated 7 months ago
- Prometheus exporter for slurm job/node data☆31Updated 3 months ago
- A daemon that uses cgroups to monitor and manage user behavior on login nodes☆60Updated 3 months ago
- Kerberos credential support for batch environments☆12Updated 3 months ago
- Confluent Cluster Management software☆32Updated this week
- An ansible role for Open Ondemand☆30Updated 7 months ago
- Deploy Kubernetes on OpenStack with RKE2☆48Updated last month
- ☆36Updated 3 weeks ago
- Terraform modules to replicate the HPC user experience in the cloud☆137Updated this week
- Docker local slurm cluster☆55Updated 6 months ago
- server for storage and management of singularity images☆104Updated 4 months ago
- Info on CHPC Open OnDemand installation and customization☆13Updated 6 months ago
- GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.☆42Updated last month
- ☆33Updated last week
- This web portal is intended to give HPC users a view of the overall use of the HPC cluster and their own use.☆28Updated last week