elastic-ai / elastic-gpu-exporter
A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.
☆19Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for elastic-gpu-exporter
- Device-plugin for volcano vgpu which support hard resource isolation☆48Updated 2 weeks ago
- The API (CRD) of Volcano☆33Updated last week
- kubernetes-operator is a control plane and manage all kubernetes cluster lifecycle.☆77Updated last year
- Device plugins for Volcano, e.g. GPU☆104Updated 2 months ago
- ☆31Updated 3 years ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆135Updated 2 years ago
- ☆129Updated 3 years ago
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆17Updated 3 weeks ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆52Updated 2 weeks ago
- elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.☆54Updated 2 years ago
- A controller that helps you manipulate arbitrary load balancers☆56Updated last year
- ☆109Updated 2 years ago
- Providing high-performance network for Kubernetes☆110Updated 6 months ago
- Another great app kind for Kubernetes!☆113Updated 2 years ago
- Large-scale Kubernetes cluster diagnostic tool.☆137Updated 10 months ago
- This repo is a sample for Kubernetes scheduler framework.☆93Updated 4 years ago
- A simulator of Kuberntes for batch and service workload.☆45Updated 3 years ago
- Using CRDs to manage GPU resources in Kubernetes.☆192Updated 2 years ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆105Updated last month
- Kubernetes Initializer example to support LXCFS - https://yq.aliyun.com/articles/566208☆51Updated 5 years ago
- a sample to showcase how to create a k8s scheduler extender☆56Updated 4 years ago
- A out of tree pod autoscaler based on HPA(v2beta2)☆18Updated 2 years ago
- ☆14Updated 2 years ago
- ControllerMesh is a solution that helps developers manage their controllers/operators better with enhanced isolation.☆63Updated last year
- Kubernetes Custom Metrics API and External Metrics API for Alibaba Cloud☆55Updated 5 months ago
- ⎈ Kubernetes cloud-controller-manager for Baidu Cloud.☆39Updated 3 years ago
- ☆28Updated last year
- This repo is a sample for Kubernetes scheduler framework.☆49Updated 3 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆84Updated 3 months ago
- An open cloud native capacity solution which helps you achieve ultimate resource utilization in an intelligent and risk-free way.☆168Updated this week