gpucloud/k8s-device-plugin

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gpucloud/k8s-device-plugin)

gpucloud / k8s-device-plugin

NVIDIA device plugin for Kubernetes

☆15

Alternatives and similar repositories for k8s-device-plugin

Users that are interested in k8s-device-plugin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

volcano-sh / resource-exporter
View on GitHub
Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.
☆19Updated this week
volcano-sh / community
View on GitHub
Volcano community content
☆13Updated this week
PaddleFlow / paddle-operator
View on GitHub
Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano
☆32May 19, 2023Updated 3 years ago
AliyunContainerService / gpu-analyzer
View on GitHub
GPU analyzer for Kubernetes GPU clusters
☆16Apr 11, 2020Updated 6 years ago
volcano-retired / scheduler
View on GitHub
The scheduler of Volcano, built based on kubernetes-sigs/kube-batch
☆14Jul 7, 2019Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
tkestack / go-nvml
View on GitHub
☆30Jun 15, 2021Updated 5 years ago
AliyunContainerService / et-operator
View on GitHub
Kubernetes Operator for AI and Bigdata Elastic Training
☆91Jan 10, 2025Updated last year
anuvu / squashfs
View on GitHub
golang library for accessing squashfs filesystems that utilizes squashfs-tools-ng
☆10Sep 12, 2023Updated 2 years ago
NVIDIA / knavigator
View on GitHub
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆79Jul 6, 2026Updated 2 weeks ago
SeldonIO / trtis-k8s-scheduler
View on GitHub
Custom Scheduler to deploy ML models to TRTIS for GPU Sharing
☆12Apr 1, 2020Updated 6 years ago
intel / nodus
View on GitHub
Simulated large clusters for Kubernetes scheduler validation.
☆15Jan 3, 2023Updated 3 years ago
kleveross / ftlib
View on GitHub
Fault-tolerant for DL frameworks
☆71Jul 5, 2023Updated 3 years ago
thecooltechguy / mlbot
View on GitHub
A fast & easy way to train ML models in your cloud, directly from your laptop.
☆14Mar 28, 2022Updated 4 years ago
pokerfaceSad / GPUMounter
View on GitHub
A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod
☆127Feb 23, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jqlu / ackctl
View on GitHub
☆10Jul 29, 2020Updated 5 years ago
dankamongmen / libcudest
View on GitHub
Open-source implementation of the CUDA API.
☆13May 5, 2012Updated 14 years ago
AnonymousMetaLearn / Towards-benchmarking-and-dissecting-one-shot-neural-architecture-search
View on GitHub
☆18Nov 13, 2019Updated 6 years ago
NVIDIA / go-gpuallocator
View on GitHub
Go Abstraction for Allocating NVIDIA GPUs with Custom Policies
☆123Updated this week
k8stopologyawareschedwg / resource-topology-exporter
View on GitHub
Resource Topology exporter for Topology Aware Scheduler
☆15Jun 30, 2026Updated 2 weeks ago
d-run / drun-docs
View on GitHub
d.run website
☆17Jul 3, 2026Updated 2 weeks ago
openshift / route-override-cni
View on GitHub
CNI plugin to override routes
☆16May 23, 2026Updated last month
copilot-io / runtime-copilot
View on GitHub
The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…
☆13May 16, 2023Updated 3 years ago
volcano-sh / devices
View on GitHub
Device plugins for Volcano, e.g. GPU
☆137Mar 20, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aws-samples / aws-eks-deep-learning-benchmark
View on GitHub
Deep learning benchmark utility and optimization tips on EKS.
☆47Aug 13, 2019Updated 6 years ago
kubean-io / kube-node-tuning
View on GitHub
Manage kubernetes node-level kernel tuning ( using sysctl ).
☆30Nov 21, 2025Updated 7 months ago
tencentyun / chdfs-hadoop-plugin
View on GitHub
the hadoop plugin for chdfs
☆15Feb 27, 2026Updated 4 months ago
yylin1 / papers-notebook-with-scheduling
View on GitHub
碩士論文文獻筆記（Deep Learning、Scheduling、Distributed、Kubernetes）
☆51May 5, 2019Updated 7 years ago
kubeflow / mxnet-operator
View on GitHub
A Kubernetes operator for mxnet jobs
☆52Dec 1, 2021Updated 4 years ago
kubedl-io / kubedl
View on GitHub
Run your deep learning workloads on Kubernetes more easily and efficiently.
☆532Mar 4, 2024Updated 2 years ago
kleveross / klever-model-registry
View on GitHub
Cloud Native Machine Learning Model Registry
☆81Jan 12, 2023Updated 3 years ago
microsoft / openpai-runtime
View on GitHub
Runtime for deep learning workload
☆21May 24, 2022Updated 4 years ago
kubernetes-sigs / knftables
View on GitHub
golang nftables library
☆38May 20, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
elastic-ai / elastic-gpu-agent
View on GitHub
elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.
☆54Jul 27, 2022Updated 3 years ago
k8snetworkplumbingwg / multi-networkpolicy-tc
View on GitHub
Linux Traffic Control (TC) based implementation of Kubernetes NPWG MultiNetworkPolicy API
☆12Jul 20, 2023Updated 3 years ago
tensorchord / ai-infra-statistics
View on GitHub
This repository contains statistics about the AI Infrastructure products.
☆16Feb 27, 2025Updated last year
tkestack / csi-operator
View on GitHub
☆14Mar 29, 2022Updated 4 years ago
cioc / ray-kubernetes
View on GitHub
Ray Framework (https://github.com/ray-project/ray) on Kubernetes
☆13Oct 12, 2018Updated 7 years ago
wasmCloud / kasmcloud
View on GitHub
Running and managing Wasm(actors) and capability providers in Kubernetes
☆32Dec 12, 2023Updated 2 years ago
Qihoo360 / dgl-operator
View on GitHub
The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes
☆44Sep 15, 2021Updated 4 years ago