stackhpc / slurm-k8s-cluster
A Slurm cluster for Kubernetes
☆36Updated last month
Related projects: ⓘ
- ☆53Updated last week
- Mellanox Network Operator☆201Updated last week
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆26Updated 3 weeks ago
- NVIDIA k8s device plugin for Kubevirt☆222Updated last month
- RDMA CNI plugin for containerized workloads☆39Updated 2 weeks ago
- The BeeGFS Container Storage Interface (CSI) driver provides high performing and scalable storage for workloads running in Kubernetes. 📦…☆66Updated last month
- ☆202Updated 2 weeks ago
- ☆187Updated this week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆227Updated this week
- slurm cluster over k8s☆14Updated 4 years ago
- Device plugins for Volcano, e.g. GPU☆98Updated last week
- GPU plugin to the node feature discovery for Kubernetes☆287Updated 3 months ago
- MIG Partition Editor for NVIDIA GPUs☆163Updated this week
- InfiniBand SR-IOV CNI☆42Updated 2 weeks ago
- Prometheus exporter for a Infiniband Fabric☆52Updated 9 months ago
- Singularity implementation of k8s operator for interacting with SLURM.☆118Updated 3 years ago
- ☆57Updated 3 weeks ago
- NVIDIA NCCL Tests for Distributed Training☆59Updated last month
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆22Updated last week
- Holistic job manager on Kubernetes☆107Updated 7 months ago
- A collection of community maintained NRI plugins☆54Updated this week
- IP Over Infiniband (IPoIB) CNI Plugin☆11Updated 3 months ago
- Bitfusion with Kubernetes Integration Support☆51Updated 10 months ago
- Operator for provisioning and configuring SR-IOV CNI plugin and device plugin☆80Updated this week
- noVNC for kubevirt☆66Updated 6 months ago
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆33Updated this week
- ☆43Updated 3 months ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆133Updated this week
- ☆45Updated 2 weeks ago
- K8s device plugin for GPU sharing☆93Updated last year