volcano-sh / resource-exporter
Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.
β17Updated 5 months ago
Alternatives and similar repositories for resource-exporter:
Users that are interested in resource-exporter are comparing it to the libraries listed below
- The API (CRD) of Volcanoβ37Updated this week
- Resource Topology exporter for Topology Aware Schedulerβ16Updated this week
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β20Updated 4 months ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β64Updated 2 weeks ago
- β31Updated 3 years ago
- NVIDIA device plugin for Kubernetesβ15Updated 5 years ago
- β114Updated 2 years ago
- a topology-scheduler and a descheduler extened from descheduler.β15Updated 3 years ago
- π§― Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.β29Updated 3 months ago
- Device plugins for Volcano, e.g. GPUβ118Updated 3 weeks ago
- A simulator of Kuberntes for batch and service workload.β46Updated 4 years ago
- Example DRA driver that developers can fork and modify to get them started writing their own.β69Updated 3 weeks ago
- GPU analyzer for Kubernetes GPU clustersβ17Updated 5 years ago
- A distributed engine for intelligent workloadβ27Updated 2 months ago
- Device-plugin for volcano vgpu which support hard resource isolationβ70Updated 3 weeks ago
- The Volcano Deschedulerβ13Updated 2 months ago
- a sample to showcase how to create a k8s scheduler extenderβ57Updated 4 years ago
- Kubernetes operator for managing the lifecycle of PaddlePaddle job.β24Updated 5 years ago
- β14Updated 3 years ago
- β61Updated this week
- Multi-cluster api gateway based on apiserver-aggregation.β98Updated 3 months ago
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policiesβ113Updated 8 months ago
- RDMA CNI plugin for containerized workloadsβ52Updated this week
- ControllerMesh is a solution that helps developers manage their controllers/operators better with enhanced isolation.β64Updated last year
- kubernetes device pluginηεΌεη€ΊδΎβ35Updated 5 years ago
- An SRIOV CNI pluginβ68Updated last week
- Load watcher is a cluster-wide aggregator of metrics, developed for Trimaran: Real Load Aware Scheduler in Kubernetes.β71Updated 3 months ago
- A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.β19Updated 3 years ago
- A controller that helps you manipulate arbitrary load balancersβ56Updated 2 years ago
- Holistic job manager on Kubernetesβ114Updated last year