SlinkyProject / slurm-bridgeLinks
Run Slurm as a Kubernetes scheduler. A Slinky project.
☆53Updated this week
Alternatives and similar repositories for slurm-bridge
Users that are interested in slurm-bridge are comparing it to the libraries listed below
Sorting:
- Run Slurm on Kubernetes. A Slinky project.☆208Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆141Updated last week
- A Slurm cluster for Kubernetes☆66Updated last year
- Slurm in Kubernetes☆43Updated last month
- A toolkit for discovering cluster network topology.☆86Updated last week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆119Updated last week
- Holistic job manager on Kubernetes☆115Updated last year
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆147Updated this week
- Deploy a Flux MiniCluster to Kubernetes with the operator☆37Updated 3 weeks ago
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Updated last year
- A tool to detect infrastructure issues on cloud native AI systems☆52Updated 3 months ago
- ☆273Updated 2 weeks ago
- KJob: Tool for CLI-loving ML researchers☆40Updated last week
- MIG Partition Editor for NVIDIA GPUs☆233Updated this week
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆26Updated last year
- OCI-compatible engine to deploy Linux containers on HPC environments.☆141Updated last year
- NVIDIA Network Operator☆306Updated last week
- Run Slurm in Kubernetes☆335Updated this week
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆44Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆289Updated last week
- ☆39Updated this week
- GenAI inference performance benchmarking tool☆137Updated this week
- Helm charts for llm-d☆50Updated 5 months ago
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆131Updated last week
- InterLink aims to provide an abstraction for the execution of a Kubernetes pod on any remote resource capable of managing a Container exe…☆96Updated this week
- ☆185Updated 3 weeks ago
- ☆87Updated last year
- llm-d helm charts and deployment examples☆48Updated last week
- llm-d benchmark scripts and tooling☆39Updated this week
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆360Updated last week