cnvrg / metagpuLinks
K8s device plugin for GPU sharing
☆98Updated 2 years ago
Alternatives and similar repositories for metagpu
Users that are interested in metagpu are comparing it to the libraries listed below
Sorting:
- JobSet: a k8s native API for distributed ML training and HPC workloads☆236Updated last week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆74Updated last month
- Smart Kubernetes Scheduling☆79Updated this week
- Automatic repair for unhealthy Kubernetes nodes☆51Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆67Updated last month
- Kubernetes-in-Kubernetes Made Simple☆86Updated 2 years ago
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆75Updated 3 weeks ago
- Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container im…☆255Updated 3 weeks ago
- ☆157Updated last week
- Sidecar container that watches Kubernetes PersistentVolumeClaims objects and triggers controller side expansion operation against a CSI e…☆134Updated this week
- A collection of community maintained NRI plugins☆82Updated this week
- An etcd operator to configure, provision, reconcile and monitor etcd clusters.☆89Updated this week
- ☆52Updated last year
- WG Serving☆27Updated last week
- Plugin to Velero which automates backing up and restoring KubeVirt/CDI objects☆38Updated this week
- The official Kubernetes operator for etcd.☆78Updated this week
- Provides a general service to support image acceleration based on kinds of accelerator like Nydus and eStargz etc.☆89Updated 2 months ago
- New generation community-driven etcd-operator!☆120Updated this week
- Manage admission policies in your Kubernetes cluster with ease☆209Updated this week
- Dragonfly Helm Charts☆37Updated this week
- K8s Node Health Check Operator☆107Updated last week
- Generates Kubernetes CRD API reference documentation☆133Updated last week
- Operator for Multi-Cluster Monitoring with Thanos.☆132Updated this week
- Checkpoint and Restore in Kubernetes☆142Updated last year
- This repo contains sidecar controller and agent for volume health monitoring.☆68Updated last week
- Libraries for implementing aggregated apiservers☆90Updated 2 months ago
- Kubernetes ClusterInventory API☆71Updated 3 months ago
- Container Object Storage (COSI) Specification. NOTE: The content of this repo has been moved to https://github.com/kubernetes-sigs/contai…☆64Updated 6 months ago
- KJob: Tool for CLI-loving ML researchers☆30Updated last week
- InterLink aims to provide an abstraction for the execution of a Kubernetes pod on any remote resource capable of managing a Container exe…☆79Updated this week