NVIDIA / k8s-dra-driver-gpuLinks

NVIDIA DRA Driver for GPUs

☆400

Alternatives and similar repositories for k8s-dra-driver-gpu

Users that are interested in k8s-dra-driver-gpu are comparing it to the libraries listed below

Sorting:

kubernetes-sigs / lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
☆526Updated this week
NVIDIA / KAI-Scheduler
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆707Updated last week
kubernetes-sigs / gateway-api-inference-extension
Gateway API Inference Extension
☆415Updated this week
kubernetes-sigs / jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
☆246Updated this week
NVIDIA / gpu-feature-discovery
GPU plugin to the node feature discovery for Kubernetes
☆302Updated last year
Mellanox / k8s-rdma-shared-dev-plugin
☆283Updated last week
NVIDIA / kubevirt-gpu-device-plugin
NVIDIA k8s device plugin for Kubevirt
☆256Updated 3 weeks ago
volcano-sh / devices
Device plugins for Volcano, e.g. GPU
☆126Updated 4 months ago
elastic-ai / elastic-gpu-scheduler
elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.
☆142Updated 2 years ago
volcano-sh / volcano-global
A federation scheduler for multi-cluster
☆48Updated last month
Mellanox / network-operator
NVIDIA Network Operator
☆268Updated this week
Project-HAMi / HAMi-core
HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container
☆191Updated last week
kubeflow / mpi-operator
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
☆487Updated last week
run-ai / fake-gpu-operator
☆130Updated 2 weeks ago
kubernetes-sigs / dra-example-driver
Example DRA driver that developers can fork and modify to get them started writing their own.
☆86Updated 2 weeks ago
InftyAI / llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
☆227Updated last week
NVIDIA / mig-parted
MIG Partition Editor for NVIDIA GPUs
☆204Updated last week
ROCm / k8s-device-plugin
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
☆334Updated last week
NVIDIA / k8s-nim-operator
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆119Updated this week
kube-queue / kube-queue
☆118Updated 2 years ago
kserve / modelmesh-serving
Controller for ModelMesh
☆239Updated last month
kubernetes-sigs / inference-perf
GenAI inference performance benchmarking tool
☆71Updated this week
project-codeflare / multi-cluster-app-dispatcher
Holistic job manager on Kubernetes
☆117Updated last year
awslabs / aws-virtual-gpu-device-plugin
AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads
☆205Updated last year
kubeflow / model-registry
Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…
☆138Updated last week
grgalex / nvshare
Practical GPU Sharing Without Memory Size Constraints
☆276Updated 4 months ago
cncf-tags / container-device-interface
☆254Updated last month
NVIDIA / knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆69Updated 2 weeks ago
NTHU-LSALAB / KubeShare
Share GPU between Pods in Kubernetes
☆211Updated 2 years ago
kubernetes-sigs / node-feature-discovery
Node feature discovery for Kubernetes
☆920Updated this week