stackhpc / slurm-k8s-clusterLinks
A Slurm cluster for Kubernetes
☆62Updated last year
Alternatives and similar repositories for slurm-k8s-cluster
Users that are interested in slurm-k8s-cluster are comparing it to the libraries listed below
Sorting:
- Run Slurm on Kubernetes. A Slinky project.☆138Updated 2 weeks ago
- MIG Partition Editor for NVIDIA GPUs☆204Updated last week
- NVIDIA Network Operator☆268Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆302Updated last year
- ☆254Updated last month
- NVIDIA k8s device plugin for Kubevirt☆256Updated 3 weeks ago
- ☆64Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆487Updated last week
- NVIDIA DRA Driver for GPUs☆400Updated last week
- ☆26Updated last month
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆36Updated last week
- Run Slurm in Kubernetes☆258Updated this week
- ☆283Updated last week
- ☆130Updated 2 weeks ago
- Singularity implementation of k8s operator for interacting with SLURM.☆117Updated 4 years ago
- NVIDIA NCCL Tests for Distributed Training☆100Updated last week
- An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.☆261Updated this week
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆121Updated last week
- Slurm in Kubernetes☆43Updated 7 months ago
- RDMA CNI plugin for containerized workloads☆55Updated this week
- Run cloud native workloads on NVIDIA GPUs☆188Updated this week
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆26Updated 7 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆119Updated this week
- Prometheus exporter for a Infiniband Fabric☆65Updated last year
- JobSet: a k8s native API for distributed ML training and HPC workloads☆246Updated this week
- Holistic job manager on Kubernetes☆117Updated last year
- Device plugins for Volcano, e.g. GPU☆126Updated 4 months ago
- Golang bindings for Nvidia Datacenter GPU Manager (DCGM)☆123Updated this week
- Bitfusion with Kubernetes Integration Support☆50Updated last year
- K8s device plugin for GPU sharing☆98Updated 2 years ago