cnvrg / metagpuLinks
K8s device plugin for GPU sharing
☆98Updated 2 years ago
Alternatives and similar repositories for metagpu
Users that are interested in metagpu are comparing it to the libraries listed below
Sorting:
- JobSet: a k8s native API for distributed ML training and HPC workloads☆296Updated last week
- ☆187Updated last month
- Example DRA driver that developers can fork and modify to get them started writing their own.☆111Updated this week
- Automatic repair for unhealthy Kubernetes nodes☆63Updated 3 weeks ago
- ☆30Updated 4 months ago
- K8s Node Health Check Operator☆135Updated this week
- InterLink aims to provide an abstraction for the execution of a Kubernetes pod on any remote resource capable of managing a Container exe…☆97Updated this week
- ☆62Updated last year
- Kubernetes-in-Kubernetes Made Simple☆88Updated 2 years ago
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆87Updated last month
- Smart Kubernetes Scheduling☆81Updated last week
- Cloud Native Artifacial Intelligence Model Format Specification☆166Updated this week
- Checkpoint and Restore in Kubernetes☆160Updated last year
- An etcd operator to configure, provision, reconcile and monitor etcd clusters.☆104Updated this week
- Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container im…☆272Updated 3 weeks ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆73Updated 5 months ago
- Manage admission policies in your Kubernetes cluster with ease☆220Updated last week
- New generation community-driven etcd-operator!☆134Updated this week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec☆59Updated this week
- Operator for Multi-Cluster Monitoring with Thanos.☆137Updated this week
- Sidecar container that watches Kubernetes PersistentVolumeClaims objects and triggers controller side expansion operation against a CSI e…☆143Updated 3 weeks ago
- The official Kubernetes operator for etcd.☆111Updated this week
- Provides a general service to support image acceleration based on kinds of accelerator like Nydus and eStargz etc.☆94Updated 2 months ago
- DNS service discovery across connected Kubernetes clusters.☆145Updated this week
- WG Serving☆32Updated 3 weeks ago
- GenAI inference performance benchmarking tool☆140Updated 2 weeks ago
- Kubernetes ClusterInventory API☆82Updated last month
- Operator for managing Node Feature Discovery deployment☆74Updated 5 months ago
- A collection of community maintained NRI plugins☆100Updated 3 weeks ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆140Updated 3 weeks ago