SlinkyProject / slurm-operator
Run Slurm on Kubernetes. A Slinky project.
☆88Updated 2 weeks ago
Alternatives and similar repositories for slurm-operator
Users that are interested in slurm-operator are comparing it to the libraries listed below
Sorting:
- Slurm in Kubernetes☆41Updated 5 months ago
- A Slurm cluster for Kubernetes☆57Updated 9 months ago
- ☆24Updated last week
- InterLink aims to provide an abstraction for the execution of a Kubernetes pod on any remote resource capable of managing a Container exe…☆69Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆226Updated last week
- Run Slurm in Kubernetes☆221Updated this week
- ☆110Updated last week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆355Updated this week
- ☆248Updated last week
- K8s device plugin for GPU sharing☆100Updated 2 years ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆96Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆103Updated this week
- ☆62Updated this week
- KJob: Tool for CLI-loving ML researchers☆28Updated last week
- GenAI inference performance benchmarking tool☆41Updated this week
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆34Updated last week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆176Updated 3 months ago
- Deploy a Flux MiniCluster to Kubernetes with the operator☆32Updated this week
- ☆150Updated last month
- Testing if I can implement slurm in an operator☆14Updated 6 months ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆66Updated last week
- NVIDIA k8s device plugin for Kubevirt☆252Updated last month
- NVIDIA Network Operator☆248Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆70Updated this week
- ☆85Updated 8 months ago
- A toolkit for discovering cluster network topology.☆46Updated 2 weeks ago
- Holistic job manager on Kubernetes☆115Updated last year
- ☆44Updated this week
- Singularity implementation of k8s operator for interacting with SLURM.☆117Updated 4 years ago
- MIG Partition Editor for NVIDIA GPUs☆198Updated last week