IBM / Bridge-OperatorLinks
Bridge operator repo
☆21Updated last month
Alternatives and similar repositories for Bridge-Operator
Users that are interested in Bridge-Operator are comparing it to the libraries listed below
Sorting:
- llm-d benchmark scripts and tooling☆31Updated this week
- Health checks for Azure N- and H-series VMs.☆54Updated 3 weeks ago
- Cloud Native Benchmarking of Foundation Models☆44Updated 3 months ago
- Home of the HPC Compatible Kubernetes Integration for IBM Spectrum LSF☆43Updated 4 years ago
- A tool to detect infrastructure issues on cloud native AI systems☆49Updated last month
- MIG Partition Editor for NVIDIA GPUs☆222Updated last week
- Holistic job manager on Kubernetes☆116Updated last year
- ☆68Updated last week
- A workload for deploying LLM inference services on Kubernetes☆93Updated this week
- llm-d helm charts and deployment examples☆45Updated last month
- A toolkit for discovering cluster network topology.☆74Updated last week
- A distributed system for Elastic Workload☆31Updated last month
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆26Updated 10 months ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆33Updated 3 weeks ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆70Updated 3 months ago
- ☆265Updated 2 weeks ago
- A Slurm cluster for Kubernetes☆65Updated last year
- Distributed KV cache coordinator☆80Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆130Updated last week
- NVIDIA NCCL Tests for Distributed Training☆118Updated last week
- ☆304Updated last week
- GenAI inference performance benchmarking tool☆107Updated this week
- Run Slurm as a Kubernetes scheduler. A Slinky project.☆45Updated this week
- Bitfusion with Kubernetes Integration Support☆50Updated 2 years ago
- Run Slurm on Kubernetes. A Slinky project.☆178Updated last week
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆145Updated 2 years ago
- A simulator of Kuberntes for batch and service workload.☆49Updated 4 years ago
- GPU plugin to the node feature discovery for Kubernetes☆306Updated last year
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆497Updated this week
- Testing if I can implement slurm in an operator☆15Updated last year