cnvrg / metagpuLinks
K8s device plugin for GPU sharing
☆98Updated 2 years ago
Alternatives and similar repositories for metagpu
Users that are interested in metagpu are comparing it to the libraries listed below
Sorting:
- JobSet: a k8s native API for distributed ML training and HPC workloads☆300Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆114Updated last week
- Automatic repair for unhealthy Kubernetes nodes☆65Updated this week
- Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container im…☆274Updated this week
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆88Updated this week
- ☆191Updated last week
- K8s Node Health Check Operator☆135Updated last week
- Smart Kubernetes Scheduling☆81Updated last week
- Kubernetes-in-Kubernetes Made Simple☆88Updated 2 years ago
- Kubernetes Work API☆69Updated 3 weeks ago
- Operator for managing Node Feature Discovery deployment☆73Updated 5 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆142Updated last week
- ☆35Updated 5 months ago
- Sidecar container that watches Kubernetes PersistentVolumeClaims objects and triggers controller side expansion operation against a CSI e…☆144Updated this week
- ☆62Updated last year
- Cloud Native Artifacial Intelligence Model Format Specification☆174Updated this week
- An etcd operator to configure, provision, reconcile and monitor etcd clusters.☆107Updated this week
- InterLink aims to provide an abstraction for the execution of a Kubernetes pod on any remote resource capable of managing a Container exe…☆100Updated this week
- Manage admission policies in your Kubernetes cluster with ease☆222Updated this week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec☆60Updated last week
- GenAI inference performance benchmarking tool☆141Updated this week
- DNS service discovery across connected Kubernetes clusters.☆147Updated this week
- This repository hosts the Multi-Cluster Service APIs. Providers can import packages in this repo to ensure their multi-cluster service co…☆251Updated last month
- High fidelity and scalable capacity and usage metrics for Kubernetes clusters☆132Updated 10 months ago
- WG Serving☆34Updated last month
- command line tool to bootstrap open-cluster-management control plane.☆96Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆74Updated 6 months ago
- Operator for Multi-Cluster Monitoring with Thanos.☆137Updated this week
- CAAPH uses Helm charts to manage the installation and lifecycle of Cluster API add-ons.☆170Updated this week
- Checkpoint and Restore in Kubernetes☆162Updated last year