AliyunContainerService / gpushare-scheduler-extenderLinks
GPU Sharing Scheduler for Kubernetes Cluster
☆1,495Updated last year
Alternatives and similar repositories for gpushare-scheduler-extender
Users that are interested in gpushare-scheduler-extender are comparing it to the libraries listed below
Sorting:
- GPU Sharing Device Plugin for Kubernetes Cluster☆487Updated 2 years ago
- ☆883Updated last year
- OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow app…☆573Updated last year
- A CLI for Kubeflow.☆795Updated last week
- ☆535Updated last year
- NVIDIA device plugin for Kubernetes☆3,427Updated last week
- Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)☆2,122Updated this week
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆530Updated last year
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,231Updated last week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,093Updated 2 years ago
- ☆132Updated 4 years ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆210Updated 2 weeks ago
- A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, …☆1,583Updated last week
- GPU plugin to the node feature discovery for Kubernetes☆305Updated last year
- Using CRDs to manage GPU resources in Kubernetes.☆209Updated 2 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆494Updated last week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,289Updated this week
- Device plugins for Volcano, e.g. GPU☆128Updated 5 months ago
- Distributed AI Model Training and Fine-Tuning on Kubernetes☆1,909Updated this week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,382Updated last week
- Kubeflow helm chart☆145Updated 2 years ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆572Updated last week
- Kubeflow Deployment Manifests☆947Updated this week
- ☆52Updated 2 months ago
- An edge-native container management system for edge computing☆1,061Updated last year
- NVIDIA DRA Driver for GPUs☆441Updated this week
- Share GPU between Pods in Kubernetes☆212Updated 2 years ago
- ☆295Updated last week
- Tools for monitoring NVIDIA GPUs on Linux☆1,051Updated 3 years ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆145Updated 2 years ago