cnvrg / metagpu
K8s device plugin for GPU sharing
☆99Updated last year
Alternatives and similar repositories for metagpu:
Users that are interested in metagpu are comparing it to the libraries listed below
- JobSet: a k8s native API for distributed ML training and HPC workloads☆175Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆177Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆57Updated last week
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆65Updated last week
- Kubernetes Work API☆61Updated 2 months ago
- Kubernetes-in-Kubernetes Made Simple☆86Updated last year
- Smart Kubernetes Scheduling☆73Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆59Updated this week
- Automatic repair for unhealthy Kubernetes nodes☆47Updated this week
- ☆48Updated 10 months ago
- K8s Node Health Check Operator☆107Updated 2 weeks ago
- Manage admission policies in your Kubernetes cluster with ease☆201Updated this week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆174Updated 7 months ago
- Sidecar container that watches Kubernetes PersistentVolumeClaims objects and triggers controller side expansion operation against a CSI e…☆129Updated this week
- Container Object Storage Interface (COSI) API responsible to define API for COSI objects. NOTE: The content of this repo has been moved t…☆70Updated 2 months ago
- command line tool to bootstrap open-cluster-management control plane.☆83Updated last week
- Operator for Multi-Cluster Monitoring with Thanos.☆127Updated this week
- CAAPH uses Helm charts to manage the installation and lifecycle of Cluster API add-ons.☆128Updated this week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆305Updated this week
- Gateway API Inference Extension☆129Updated this week
- mck8s: Orchestration platform for multi-cluster k8s environments☆73Updated last year
- Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container im…☆232Updated last week
- Container Object Storage Interface (COSI) controller responsible to manage lifecycle of COSI objects. NOTE: The content of this repo has …☆95Updated 2 months ago
- Contains documentation for projectsveltos☆70Updated this week
- ☆230Updated 2 months ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆137Updated 2 years ago
- This repository hosts the Multi-Cluster Service APIs. Providers can import packages in this repo to ensure their multi-cluster service co…☆218Updated this week
- An easier to use and smarter etcd defragmentation tool☆96Updated this week
- ☆52Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆80Updated this week