stackhpc / slurm-k8s-clusterLinks
A Slurm cluster for Kubernetes
☆65Updated last year
Alternatives and similar repositories for slurm-k8s-cluster
Users that are interested in slurm-k8s-cluster are comparing it to the libraries listed below
Sorting:
- Run Slurm on Kubernetes. A Slinky project.☆182Updated this week
- Run Slurm in Kubernetes☆311Updated this week
- ☆267Updated 3 weeks ago
- NVIDIA Network Operator☆289Updated this week
- MIG Partition Editor for NVIDIA GPUs☆224Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆306Updated last year
- NVIDIA k8s device plugin for Kubevirt☆267Updated this week
- ☆68Updated last week
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆38Updated 2 weeks ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆497Updated this week
- ☆28Updated last week
- ☆159Updated 2 weeks ago
- NVIDIA DRA Driver for GPUs☆477Updated this week
- Run Slurm as a Kubernetes scheduler. A Slinky project.☆48Updated this week
- Singularity implementation of k8s operator for interacting with SLURM.☆117Updated 4 years ago
- IP Over Infiniband (IPoIB) CNI Plugin☆16Updated last week
- Run cloud native workloads on NVIDIA GPUs☆204Updated last month
- ☆305Updated this week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆110Updated 3 weeks ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆131Updated this week
- Slurm in Kubernetes☆43Updated last month
- JobSet: a k8s native API for distributed ML training and HPC workloads☆276Updated last week
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆138Updated 2 weeks ago
- Holistic job manager on Kubernetes☆116Updated last year
- RDMA CNI plugin for containerized workloads☆58Updated last week
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆354Updated 2 weeks ago
- Prometheus exporter for a Infiniband Fabric☆68Updated last year
- A toolkit for discovering cluster network topology.☆76Updated last week
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆89Updated 3 years ago
- NVIDIA NCCL Tests for Distributed Training☆121Updated last week