cnvrg / metagpu
K8s device plugin for GPU sharing
☆100Updated last year
Alternatives and similar repositories for metagpu:
Users that are interested in metagpu are comparing it to the libraries listed below
- JobSet: a k8s native API for distributed ML training and HPC workloads☆194Updated this week
- Kubernetes-in-Kubernetes Made Simple☆86Updated last year
- Example DRA driver that developers can fork and modify to get them started writing their own.☆63Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆63Updated last week
- mck8s: Orchestration platform for multi-cluster k8s environments☆73Updated last year
- Sidecar container that watches Kubernetes PersistentVolumeClaims objects and triggers controller side expansion operation against a CSI e…☆130Updated this week
- Smart Kubernetes Scheduling☆76Updated this week
- K8s Node Health Check Operator☆107Updated last week
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆67Updated last week
- Automatic repair for unhealthy Kubernetes nodes☆50Updated last week
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆27Updated 2 weeks ago
- Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container im…☆243Updated last week
- Operator for Multi-Cluster Monitoring with Thanos.☆132Updated this week
- Provides a general service to support image acceleration based on kinds of accelerator like Nydus and eStargz etc.☆85Updated last month
- This repo contains sidecar controller and agent for volume health monitoring.☆67Updated this week
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆140Updated 2 years ago
- Kubernetes Work API☆65Updated last month
- ☆50Updated last year
- Manage admission policies in your Kubernetes cluster with ease☆208Updated this week
- Kubernetes ClusterInventory API☆64Updated this week
- Contains documentation for projectsveltos☆78Updated this week
- DNS service discovery across connected Kubernetes clusters.☆113Updated this week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆99Updated this week
- Dragonfly Helm Charts☆35Updated last week
- An etcd operator to configure, provision, reconcile and monitor etcd clusters.☆85Updated this week
- ☆112Updated last week
- kube-trigger watches events and triggers actions in a programmable way.☆50Updated last year
- The Sail Operator is able to install and manage the lifecycle of the Istio control plane in an Kubernetes & OpenShift cluster.☆56Updated this week
- Operator for managing Node Feature Discovery deployment☆68Updated last week
- Libraries for implementing aggregated apiservers☆87Updated last week