NVIDIA device plugin for Kubernetes
☆15Sep 9, 2019Updated 6 years ago
Alternatives and similar repositories for k8s-device-plugin
Users that are interested in k8s-device-plugin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆19May 30, 2025Updated 10 months ago
- GPU analyzer for Kubernetes GPU clusters☆17Apr 11, 2020Updated 6 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆32May 19, 2023Updated 2 years ago
- The scheduler of Volcano, built based on kubernetes-sigs/kube-batch☆14Jul 7, 2019Updated 6 years ago
- ☆31Jun 15, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆76Apr 14, 2026Updated 2 weeks ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- Fault-tolerant for DL frameworks☆71Jul 5, 2023Updated 2 years ago
- A fast & easy way to train ML models in your cloud, directly from your laptop.☆14Mar 28, 2022Updated 4 years ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆127Feb 23, 2022Updated 4 years ago
- ☆10Jul 29, 2020Updated 5 years ago
- Kernel for Kubeflow in Jupyter Notebook☆65Aug 13, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Open-source implementation of the CUDA API.☆13May 5, 2012Updated 13 years ago
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆123Apr 21, 2026Updated last week
- ☆18Nov 13, 2019Updated 6 years ago
- Resource Topology exporter for Topology Aware Scheduler☆15Updated this week
- CNI plugin to override routes☆16Apr 15, 2026Updated 2 weeks ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- Device plugins for Volcano, e.g. GPU☆136Mar 20, 2025Updated last year
- Deep learning benchmark utility and optimization tips on EKS.☆47Aug 13, 2019Updated 6 years ago
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆30Nov 21, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 碩士論文文獻筆記(Deep Learning、Scheduling、Distributed、Kubernetes)☆51May 5, 2019Updated 6 years ago
- the hadoop plugin for chdfs☆15Feb 27, 2026Updated 2 months ago
- A Kubernetes operator for mxnet jobs☆52Dec 1, 2021Updated 4 years ago
- CSI driver to bootstrap COSI workloads☆18May 7, 2023Updated 2 years ago
- Cloud Native Machine Learning Model Registry☆81Jan 12, 2023Updated 3 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆532Mar 4, 2024Updated 2 years ago
- Runtime for deep learning workload☆21May 24, 2022Updated 3 years ago
- Linux Traffic Control (TC) based implementation of Kubernetes NPWG MultiNetworkPolicy API☆12Jul 20, 2023Updated 2 years ago
- elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.☆55Jul 27, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- golang nftables library☆35Mar 31, 2026Updated 3 weeks ago
- Ray Framework (https://github.com/ray-project/ray) on Kubernetes☆13Oct 12, 2018Updated 7 years ago
- Cloud Native ML/DL Platform☆132Sep 9, 2020Updated 5 years ago
- ☆14Mar 29, 2022Updated 4 years ago
- This repository contains statistics about the AI Infrastructure products.☆17Feb 27, 2025Updated last year
- Running and managing Wasm(actors) and capability providers in Kubernetes☆31Dec 12, 2023Updated 2 years ago
- 高性能计算实验室文档模板☆14Aug 11, 2017Updated 8 years ago