cnvrg / metagpu
K8s device plugin for GPU sharing
☆99Updated last year
Alternatives and similar repositories for metagpu:
Users that are interested in metagpu are comparing it to the libraries listed below
- JobSet: a k8s native API for distributed ML training and HPC workloads☆187Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆257Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆61Updated 3 weeks ago
- Kubernetes-in-Kubernetes Made Simple☆86Updated last year
- Automatic repair for unhealthy Kubernetes nodes☆50Updated 2 weeks ago
- Sidecar container that watches Kubernetes PersistentVolumeClaims objects and triggers controller side expansion operation against a CSI e…☆130Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆57Updated this week
- K8s Node Health Check Operator☆107Updated last month
- Gateway API Inference Extension☆150Updated this week
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆27Updated last year
- This repository hosts the Multi-Cluster Service APIs. Providers can import packages in this repo to ensure their multi-cluster service co…☆222Updated 3 weeks ago
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆65Updated 2 weeks ago
- ☆52Updated this week
- Smart Kubernetes Scheduling☆73Updated this week
- Operator for Multi-Cluster Monitoring with Thanos.☆130Updated this week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆174Updated 3 weeks ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆138Updated 2 years ago
- ☆37Updated this week
- Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container im…☆234Updated this week
- Kubernetes Work API☆61Updated this week
- Kubernetes ClusterInventory API☆62Updated this week
- Kubernetes in Kubernetes☆201Updated this week
- The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.☆95Updated this week
- Holistic job manager on Kubernetes☆111Updated 11 months ago
- command line tool to bootstrap open-cluster-management control plane.☆84Updated 3 weeks ago
- An etcd operator to configure, provision, reconcile and monitor etcd clusters.☆83Updated last week
- Node Resource Interface☆279Updated 2 weeks ago
- Container Object Storage Interface (COSI) API responsible to define API for COSI objects. NOTE: The content of this repo has been moved t…☆70Updated 2 months ago
- Provides a general service to support image acceleration based on kinds of accelerator like Nydus and eStargz etc.☆84Updated this week
- elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.☆54Updated 2 years ago