ashrafgt / k8s-gpu-hpaLinks

Horizontal Pod Autoscaling for Kubernetes using Nvidia GPU Metrics

☆32

Alternatives and similar repositories for k8s-gpu-hpa

Users that are interested in k8s-gpu-hpa are comparing it to the libraries listed below

Sorting:

coreweave / kubernetes-cloud
Getting Started with the CoreWeave Kubernetes GPU Cloud
☆73Updated 2 weeks ago
elastic-ai / elastic-gpu-scheduler
elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.
☆141Updated 2 years ago
nebuly-ai / k8s-device-plugin
NVIDIA device plugin for Kubernetes
☆48Updated last year
kserve / modelmesh
Distributed Model Serving Framework
☆173Updated 3 weeks ago
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆205Updated 2 months ago
coreweave / tensorizer
Module, Model, and Tensor Serialization/Deserialization
☆241Updated 2 weeks ago
volcano-sh / devices
Device plugins for Volcano, e.g. GPU
☆124Updated 3 months ago
kserve / modelmesh-serving
Controller for ModelMesh
☆232Updated 2 weeks ago
NVIDIA / k8s-dra-driver-gpu
Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes
☆377Updated last week
elastic-ai / elastic-gpu-agent
elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.
☆54Updated 2 years ago
GoogleCloudPlatform / container-engine-accelerators
Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine
☆235Updated this week
NVIDIA / ais-k8s
Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
☆98Updated last week
kubernetes-sigs / jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
☆237Updated this week
NVIDIA / gpu-feature-discovery
GPU plugin to the node feature discovery for Kubernetes
☆300Updated last year
Deepomatic / shared-gpu-nvidia-k8s-device-plugin
Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times
☆88Updated 3 years ago
awslabs / aws-virtual-gpu-device-plugin
AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads
☆205Updated last year
run-ai / runai-model-streamer
☆222Updated this week
kserve / website
User documentation for KServe.
☆106Updated this week
run-ai / fake-gpu-operator
☆114Updated 3 weeks ago
kserve / modelmesh-performance
ModelMesh Performance Scripts, Dashboard and Pipelines
☆11Updated last month
AliyunContainerService / gpushare-device-plugin
GPU Sharing Device Plugin for Kubernetes Cluster
☆485Updated 2 years ago
kserve / modelmesh-runtime-adapter
Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods
☆21Updated 2 weeks ago
coreweave / ml-containers
☆37Updated this week
triton-inference-server / common
Common source, scripts and utilities shared across all Triton repositories.
☆74Updated last week
tkestack / gpu-admission
☆133Updated 4 years ago
YH-Wu / Triton-Inference-Server-on-Kubernetes
☆31Updated 2 years ago
InftyAI / llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
☆204Updated this week
ehfd / nvidia-dind
Isolated DinD (Docker in Docker) container for developing and deploying Docker containers using NVIDIA GPUs and the NVIDIA container tool…
☆41Updated 10 months ago
kube-queue / kube-queue
☆117Updated 2 years ago
grgalex / nvshare
Practical GPU Sharing Without Memory Size Constraints
☆271Updated 2 months ago