cnvrg / metagpuLinks
K8s device plugin for GPU sharing
☆99Updated 2 years ago
Alternatives and similar repositories for metagpu
Users that are interested in metagpu are comparing it to the libraries listed below
Sorting:
- JobSet: a k8s native API for distributed ML training and HPC workloads☆262Updated this week
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆80Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆92Updated 2 weeks ago
- Automatic repair for unhealthy Kubernetes nodes☆53Updated last month
- Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container im…☆263Updated 2 weeks ago
- ☆60Updated last year
- Smart Kubernetes Scheduling☆81Updated this week
- ☆168Updated 3 weeks ago
- K8s Node Health Check Operator☆119Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆129Updated this week
- Operator for Multi-Cluster Monitoring with Thanos.☆137Updated this week
- Sidecar container that watches Kubernetes PersistentVolumeClaims objects and triggers controller side expansion operation against a CSI e…☆139Updated this week
- An etcd operator to configure, provision, reconcile and monitor etcd clusters.☆96Updated this week
- InterLink aims to provide an abstraction for the execution of a Kubernetes pod on any remote resource capable of managing a Container exe…☆87Updated last week
- New generation community-driven etcd-operator!☆127Updated this week
- KJob: Tool for CLI-loving ML researchers☆39Updated last week
- Manage admission policies in your Kubernetes cluster with ease☆215Updated this week
- Kubernetes-in-Kubernetes Made Simple☆86Updated 2 years ago
- The official Kubernetes operator for etcd.☆91Updated this week
- CAAPH uses Helm charts to manage the installation and lifecycle of Cluster API add-ons.☆156Updated 2 weeks ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆69Updated 2 months ago
- Checkpoint and Restore in Kubernetes☆151Updated last year
- ☆26Updated last month
- Cloud Native Artifacial Intelligence Model Format Specification☆100Updated this week
- Kubernetes ClusterInventory API☆74Updated 3 weeks ago
- Kubernetes Work API☆68Updated 3 weeks ago
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆176Updated 8 months ago
- Provides a general service to support image acceleration based on kinds of accelerator like Nydus and eStargz etc.☆94Updated last week
- Kubernetes Operator to manage node maintenance through NodeMaintenance custom resources☆44Updated 2 months ago
- The Sail Operator is able to install and manage the lifecycle of the Istio control plane in an Kubernetes & OpenShift cluster.☆83Updated this week