IBM / Bridge-OperatorLinks
Bridge operator repo
☆21Updated 4 months ago
Alternatives and similar repositories for Bridge-Operator
Users that are interested in Bridge-Operator are comparing it to the libraries listed below
Sorting:
- Home of the HPC Compatible Kubernetes Integration for IBM Spectrum LSF☆44Updated 5 years ago
- llm-d benchmark scripts and tooling☆44Updated this week
- A tool to detect infrastructure issues on cloud native AI systems☆52Updated 4 months ago
- Health checks for Azure N- and H-series VMs.☆57Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆146Updated this week
- ☆71Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆74Updated 6 months ago
- Run Slurm as a Kubernetes scheduler. A Slinky project.☆61Updated last week
- A Slurm cluster for Kubernetes☆68Updated last year
- NVIDIA NCCL Tests for Distributed Training☆134Updated last week
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆121Updated last month
- A workload for deploying LLM inference services on Kubernetes☆167Updated last week
- Cloud Native Benchmarking of Foundation Models☆45Updated 6 months ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆510Updated 2 weeks ago
- Holistic job manager on Kubernetes☆116Updated last year
- MIG Partition Editor for NVIDIA GPUs☆240Updated this week
- Run Slurm on Kubernetes. A Slinky project.☆230Updated this week
- ☆334Updated this week
- NVIDIA Networking NIC Configuration Operator For Kubernetes☆14Updated this week
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆35Updated this week
- A toolkit for discovering cluster network topology.☆96Updated last week
- Kubernetes Rdma SRIOV device plugin☆114Updated 5 years ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆145Updated 3 years ago
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆19Updated 8 months ago
- Distributed KV cache scheduling & offloading libraries☆101Updated this week
- Bitfusion with Kubernetes Integration Support☆50Updated 2 years ago
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆26Updated 3 weeks ago
- Create and deploy virtual-experiments - co-processing computational workflows☆10Updated last week
- ☆282Updated 2 weeks ago
- A federation scheduler for multi-cluster☆61Updated last week