SeldonIO / trtis-k8s-scheduler
Custom Scheduler to deploy ML models to TRTIS for GPU Sharing
☆12Updated 4 years ago
Alternatives and similar repositories for trtis-k8s-scheduler:
Users that are interested in trtis-k8s-scheduler are comparing it to the libraries listed below
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆137Updated 2 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆31Updated last year
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆88Updated 2 years ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆122Updated 2 years ago
- elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.☆54Updated 2 years ago
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆111Updated 6 months ago
- Device plugins for Volcano, e.g. GPU☆113Updated 4 months ago
- Elastic Serverless Serving based on Kubernetes, provides 0 instance serving capability.☆10Updated 3 years ago
- NVIDIA NCCL Tests for Distributed Training☆78Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆59Updated this week
- ☆129Updated 3 years ago
- NVIDIA device plugin for Kubernetes☆15Updated 5 years ago
- A simulator of Kuberntes for batch and service workload.☆45Updated 3 years ago
- ☆113Updated 2 years ago
- ☆211Updated 2 months ago
- GenAI inference performance benchmarking tool☆12Updated last week
- Automatic tuning for ML model deployment on Kubernetes☆80Updated 2 months ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆25Updated last month
- More Flexible Device Extension Capability in Kubernetes (DevicePlugins++)☆21Updated last year
- Kernel for Kubeflow in Jupyter Notebook☆67Updated 5 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆85Updated 2 weeks ago
- GPU plugin to the node feature discovery for Kubernetes☆296Updated 8 months ago
- Cloud Native Machine Learning Model Registry☆81Updated 2 years ago
- Holistic job manager on Kubernetes☆111Updated 11 months ago
- Kubernetes Rdma SRIOV device plugin☆110Updated 4 years ago
- ☆59Updated 4 months ago
- ☆31Updated 3 years ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆131Updated this week
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆76Updated 10 months ago
- ☆88Updated 2 weeks ago