run-ai / k8s-launcherLinks
☆13Updated 2 years ago
Alternatives and similar repositories for k8s-launcher
Users that are interested in k8s-launcher are comparing it to the libraries listed below
Sorting:
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆141Updated this week
- Distributed Model Serving Framework☆180Updated 2 months ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆119Updated last week
- Holistic job manager on Kubernetes☆115Updated last year
- Controller for ModelMesh☆242Updated 6 months ago
- Helm charts for llm-d☆50Updated 4 months ago
- Cloud Native Benchmarking of Foundation Models☆44Updated 4 months ago
- Helm charts for the KubeRay project☆59Updated 3 weeks ago
- ☆42Updated last week
- ☆185Updated 2 weeks ago
- A tool to detect infrastructure issues on cloud native AI systems☆52Updated 3 months ago
- Module, Model, and Tensor Serialization/Deserialization☆279Updated 4 months ago
- ☆273Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆289Updated this week
- Repository for open inference protocol specification☆61Updated 7 months ago
- K8s device plugin for GPU sharing☆99Updated 2 years ago
- OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)☆334Updated last week
- NVIDIA NCCL Tests for Distributed Training☆129Updated this week
- GenAI inference performance benchmarking tool☆137Updated this week
- Getting Started with the CoreWeave Kubernetes GPU Cloud☆79Updated 6 months ago
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆157Updated last week
- llm-d helm charts and deployment examples☆48Updated last week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Updated last year
- This is a fork/refactoring of the ajmyyra/ambassador-auth-oidc project☆89Updated last year
- markdown docs☆92Updated 2 weeks ago
- Run cloud native workloads on NVIDIA GPUs☆210Updated 2 months ago
- GPU plugin to the node feature discovery for Kubernetes☆308Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated 2 months ago
- CUDA checkpoint and restore utility☆397Updated 3 months ago
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆205Updated 2 years ago