cnvrg / metagpuLinks
K8s device plugin for GPU sharing
☆98Updated 2 years ago
Alternatives and similar repositories for metagpu
Users that are interested in metagpu are comparing it to the libraries listed below
Sorting:
- JobSet: a k8s native API for distributed ML training and HPC workloads☆241Updated last week
- ☆159Updated 3 weeks ago
- Smart Kubernetes Scheduling☆80Updated last week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆77Updated 2 weeks ago
- Operator for Multi-Cluster Monitoring with Thanos.☆133Updated last week
- Kubernetes-in-Kubernetes Made Simple☆86Updated 2 years ago
- An etcd operator to configure, provision, reconcile and monitor etcd clusters.☆90Updated last week
- Automatic repair for unhealthy Kubernetes nodes☆53Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆67Updated 2 months ago
- KJob: Tool for CLI-loving ML researchers☆31Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆115Updated this week
- Sidecar container that watches Kubernetes PersistentVolumeClaims objects and triggers controller side expansion operation against a CSI e…☆136Updated 3 weeks ago
- ☆38Updated this week
- New generation community-driven etcd-operator!☆121Updated this week
- Operator for managing Node Feature Discovery deployment☆71Updated last month
- Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container im…☆256Updated 3 weeks ago
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆76Updated 2 weeks ago
- InterLink aims to provide an abstraction for the execution of a Kubernetes pod on any remote resource capable of managing a Container exe…☆80Updated 2 weeks ago
- CAAPH uses Helm charts to manage the installation and lifecycle of Cluster API add-ons.☆153Updated last week
- Kubernetes Work API☆66Updated 2 months ago
- A Topology-Aware Custom Scheduler For Kubernetes☆65Updated 2 years ago
- Manage admission policies in your Kubernetes cluster with ease☆210Updated 2 weeks ago
- K8s Node Health Check Operator☆110Updated this week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆175Updated 5 months ago
- Holistic job manager on Kubernetes☆116Updated last year
- A collection of community maintained NRI plugins☆85Updated last week
- WG Serving☆27Updated last month
- ☆52Updated last year
- The Operator to install and manage the lifecycle of the Kuadrant components deployments.☆62Updated this week
- Container Object Storage (COSI) Specification. NOTE: The content of this repo has been moved to https://github.com/kubernetes-sigs/contai…☆63Updated 7 months ago