elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.
☆55Jul 27, 2022Updated 3 years ago
Alternatives and similar repositories for elastic-gpu-agent
Users that are interested in elastic-gpu-agent are comparing it to the libraries listed below
Sorting:
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆145Nov 21, 2022Updated 3 years ago
- A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.☆21Mar 8, 2022Updated 3 years ago
- Using CRDs to manage GPU resources in Kubernetes.☆209Nov 21, 2022Updated 3 years ago
- ☆539Jun 7, 2024Updated last year
- ☆892Apr 2, 2024Updated last year
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆127Feb 23, 2022Updated 4 years ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- An operator for managing Alluxio system on Kubernetes cluster☆13Jan 9, 2024Updated 2 years ago
- GPU plugin to the node feature discovery for Kubernetes☆307May 27, 2024Updated last year
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆121Dec 8, 2025Updated 2 months ago
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆204Nov 22, 2023Updated 2 years ago
- egg - the simple error eggregator☆14Feb 25, 2021Updated 5 years ago
- NVIDIA k8s device plugin for Kubevirt☆278Updated this week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,528Dec 29, 2023Updated 2 years ago
- ☆17Apr 22, 2022Updated 3 years ago
- A crowdsourced Azure RBAC permissions reference.☆17Jul 9, 2025Updated 7 months ago
- Kata Containers KSM throttling daemon☆25May 5, 2021Updated 4 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- ☆43May 16, 2024Updated last year
- Kubernetes operator to manage aws-auth ConfigMap for AWS EKS☆18Sep 23, 2023Updated 2 years ago
- Components and utilities which extend the Mayastor core control & data plane functionality☆25Updated this week
- Jenkin swarm slaves with docker installed☆19Dec 31, 2021Updated 4 years ago
- SCV is a distributed cluster GPU sniffer. SCV是一个分布式GPU嗅探器☆20Feb 25, 2023Updated 3 years ago
- A controller that works with native kubernetes hpas to run pod autoscaling☆19Aug 23, 2018Updated 7 years ago
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆374Jan 20, 2026Updated last month
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆531Mar 4, 2024Updated 2 years ago
- More Flexible Device Extension Capability in Kubernetes (DevicePlugins++)☆25Jun 12, 2023Updated 2 years ago
- [EOL] CSIDriver CRD object☆23Jul 27, 2020Updated 5 years ago
- ☆132Apr 19, 2021Updated 4 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs☆358Jul 7, 2025Updated 7 months ago
- GPU-scheduler-for-deep-learning☆210Nov 5, 2020Updated 5 years ago
- Yoda is a kubernetes scheduler based on GPU metrics. Yoda是一个基于GPU参数指标的 Kubernetes 调度器☆137Mar 27, 2022Updated 3 years ago
- RDMA CNI plugin for containerized workloads☆59Feb 15, 2026Updated 2 weeks ago
- Scripts for k8s scalability testing and analysis☆23Jan 4, 2018Updated 8 years ago
- NVIDIA Network Operator☆325Updated this week
- CLI tool for providing AWS credentials to a container from the host☆25Aug 14, 2023Updated 2 years ago
- Part of a PoC of Pod migration in Kubernetes☆26Dec 21, 2020Updated 5 years ago
- pip install picka - Picka is a python based data generation and randomization module which aims to increase coverage by increasing the am…☆112Sep 3, 2019Updated 6 years ago