run-ai / k8s-launcherLinks
☆13Updated 2 years ago
Alternatives and similar repositories for k8s-launcher
Users that are interested in k8s-launcher are comparing it to the libraries listed below
Sorting:
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆140Updated 3 weeks ago
- Distributed Model Serving Framework☆182Updated 3 months ago
- Helm charts for llm-d☆50Updated 5 months ago
- Module, Model, and Tensor Serialization/Deserialization☆283Updated 4 months ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆296Updated last week
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆355Updated this week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆118Updated this week
- ☆275Updated this week
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆138Updated this week
- Repository for open inference protocol specification☆61Updated 8 months ago
- ☆43Updated this week
- Helm charts for the KubeRay project☆59Updated last month
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆150Updated this week
- Holistic job manager on Kubernetes☆115Updated last year
- A toolkit for discovering cluster network topology.☆89Updated last month
- Run Slurm on Kubernetes. A Slinky project.☆213Updated last week
- ☆187Updated last month
- Getting Started with the CoreWeave Kubernetes GPU Cloud☆79Updated 6 months ago
- KJob: Tool for CLI-loving ML researchers☆40Updated 2 weeks ago
- Controller for ModelMesh☆242Updated 7 months ago
- GenAI inference performance benchmarking tool☆140Updated 2 weeks ago
- NVIDIA NCCL Tests for Distributed Training☆132Updated this week
- K8s device plugin for GPU sharing☆98Updated 2 years ago
- CUDA checkpoint and restore utility☆401Updated 3 months ago
- This is a fork/refactoring of the ajmyyra/ambassador-auth-oidc project☆89Updated last year
- Run Slurm as a Kubernetes scheduler. A Slinky project.☆55Updated 3 weeks ago
- GPU plugin to the node feature discovery for Kubernetes☆308Updated last year
- Gateway API Inference Extension☆559Updated this week
- Cloud Native Benchmarking of Foundation Models☆44Updated 5 months ago
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆162Updated this week