cnvrg / metagpuLinks
K8s device plugin for GPU sharing
☆99Updated 2 years ago
Alternatives and similar repositories for metagpu
Users that are interested in metagpu are comparing it to the libraries listed below
Sorting:
- JobSet: a k8s native API for distributed ML training and HPC workloads☆289Updated last week
- ☆185Updated 2 weeks ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆110Updated last month
- Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container im…☆270Updated this week
- Kubernetes-in-Kubernetes Made Simple☆88Updated 2 years ago
- Sidecar container that watches Kubernetes PersistentVolumeClaims objects and triggers controller side expansion operation against a CSI e…☆142Updated last week
- ☆62Updated last year
- Automatic repair for unhealthy Kubernetes nodes☆60Updated this week
- Manage admission policies in your Kubernetes cluster with ease☆220Updated last week
- Cloud Native Artifacial Intelligence Model Format Specification☆156Updated this week
- ☆30Updated 3 months ago
- Kubernetes Work API☆68Updated 3 weeks ago
- K8s Node Health Check Operator☆131Updated 2 months ago
- Operator for Multi-Cluster Monitoring with Thanos.☆138Updated this week
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆86Updated 3 weeks ago
- Smart Kubernetes Scheduling☆81Updated this week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec☆54Updated this week
- WG Serving☆32Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆141Updated this week
- DNS service discovery across connected Kubernetes clusters.☆145Updated this week
- An etcd operator to configure, provision, reconcile and monitor etcd clusters.☆103Updated this week
- This repository hosts the Multi-Cluster Service APIs. Providers can import packages in this repo to ensure their multi-cluster service co…☆249Updated last week
- KJob: Tool for CLI-loving ML researchers☆40Updated last week
- InterLink aims to provide an abstraction for the execution of a Kubernetes pod on any remote resource capable of managing a Container exe…☆96Updated this week
- Container Object Storage (COSI) Specification. NOTE: The content of this repo has been moved to https://github.com/kubernetes-sigs/contai…☆62Updated last year
- Checkpoint and Restore in Kubernetes☆160Updated last year
- Kubernetes ClusterInventory API☆81Updated 3 weeks ago
- Provides a general service to support image acceleration based on kinds of accelerator like Nydus and eStargz etc.☆94Updated last month
- Holistic job manager on Kubernetes☆115Updated last year
- command line tool to bootstrap open-cluster-management control plane.☆94Updated last month