stackhpc / slurm-k8s-cluster
A Slurm cluster for Kubernetes
☆46Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for slurm-k8s-cluster
- ☆57Updated 2 months ago
- MIG Partition Editor for NVIDIA GPUs☆174Updated this week
- Singularity implementation of k8s operator for interacting with SLURM.☆117Updated 3 years ago
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆27Updated 2 months ago
- Slurm in Kubernetes☆38Updated 2 months ago
- Mellanox Network Operator☆212Updated this week
- ☆198Updated 3 weeks ago
- NVIDIA k8s device plugin for Kubevirt☆232Updated last month
- ☆19Updated 2 months ago
- slurm cluster over k8s☆14Updated 4 years ago
- GPU plugin to the node feature discovery for Kubernetes☆293Updated 5 months ago
- The BeeGFS Container Storage Interface (CSI) driver provides high performing and scalable storage for workloads running in Kubernetes. 📦…☆65Updated 2 weeks ago
- ☆213Updated last week
- RDMA CNI plugin for containerized workloads☆41Updated 2 months ago
- Device plugins for Volcano, e.g. GPU☆105Updated 2 months ago
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆22Updated last month
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆271Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆440Updated last month
- A Slurm-based HPC workload management environment, driven by Ansible.☆51Updated this week
- Prometheus exporter for a Infiniband Fabric☆54Updated 11 months ago
- ☆51Updated 2 months ago
- Holistic job manager on Kubernetes☆108Updated 9 months ago
- noVNC for kubevirt☆67Updated 8 months ago
- NVIDIA NCCL Tests for Distributed Training☆70Updated 2 weeks ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆150Updated this week
- Prometheus exporter for performance metrics from Slurm.☆236Updated 5 months ago
- IP Over Infiniband (IPoIB) CNI Plugin☆10Updated 5 months ago
- ☆64Updated this week
- Operator for provisioning and configuring SR-IOV CNI plugin and device plugin☆84Updated this week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆170Updated 5 months ago