4paradigm/k8s-vgpu-scheduler

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/4paradigm/k8s-vgpu-scheduler)

4paradigm / k8s-vgpu-scheduler

OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.

☆594

Alternatives and similar repositories for k8s-vgpu-scheduler

Users that are interested in k8s-vgpu-scheduler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Project-HAMi / HAMi
View on GitHub
Heterogeneous GPU Sharing on Kubernetes
☆3,839Updated this week
tkestack / gpu-manager
View on GitHub
☆904Apr 2, 2024Updated 2 years ago
tkestack / vcuda-controller
View on GitHub
☆544Jun 7, 2024Updated 2 years ago
AliyunContainerService / gpushare-scheduler-extender
View on GitHub
GPU Sharing Scheduler for Kubernetes Cluster
☆1,533Dec 29, 2023Updated 2 years ago
Project-HAMi / HAMi-core
View on GitHub
HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container
☆317Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
elastic-ai / elastic-gpu
View on GitHub
Using CRDs to manage GPU resources in Kubernetes.
☆215Nov 21, 2022Updated 3 years ago
4paradigm / openaios-platform
View on GitHub
OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development…
☆100Aug 20, 2021Updated 4 years ago
AliyunContainerService / gpushare-device-plugin
View on GitHub
GPU Sharing Device Plugin for Kubernetes Cluster
☆496Jan 10, 2023Updated 3 years ago
NVIDIA / k8s-device-plugin
View on GitHub
NVIDIA device plugin for Kubernetes
☆3,812Updated this week
4paradigm / pafka
View on GitHub
Pafka is originated from the OpenAIOS project to leverage an optimized tiered storage access strategy to improve overall performance for …
☆67Jan 2, 2022Updated 4 years ago
elastic-ai / elastic-gpu-scheduler
View on GitHub
elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.
☆147Nov 21, 2022Updated 3 years ago
tkestack / gpu-admission
View on GitHub
☆131Apr 19, 2021Updated 5 years ago
pokerfaceSad / GPUMounter
View on GitHub
A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod
☆127Feb 23, 2022Updated 4 years ago
4paradigm / OpenEmbedding
View on GitHub
OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.
☆33Apr 13, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Mellanox / k8s-rdma-shared-dev-plugin
View on GitHub
☆374Updated this week
virtaitech / orion
View on GitHub
☆278Jul 6, 2023Updated 3 years ago
volcano-sh / volcano
View on GitHub
A Cloud Native Batch System (Project under CNCF)
☆5,772Updated this week
NVIDIA / gpu-operator
View on GitHub
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
☆2,783Updated this week
NTHU-LSALAB / KubeShare
View on GitHub
Share GPU between Pods in Kubernetes
☆217Feb 6, 2023Updated 3 years ago
tencentmusic / cube-studio
View on GitHub
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台，mlops算法链路全流程，算力租赁平台，notebook在线开发，拖拉拽任务流pipeline编排，多机多卡分布式训练，超参搜索，推理服务VGPU虚拟化，边缘计算，标注平台自动化标注，deepseek…
☆5,071Updated this week
NVIDIA / gpu-feature-discovery
View on GitHub
GPU plugin to the node feature discovery for Kubernetes
☆309May 27, 2024Updated 2 years ago
volcano-sh / devices
View on GitHub
Device plugins for Volcano, e.g. GPU
☆137Mar 20, 2025Updated last year
NTHU-LSALAB / Gemini
View on GitHub
An efficient GPU resource sharing system with fine-grained control for Linux platforms.
☆90Mar 25, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alibaba / GPU-scheduler-for-deep-learning
View on GitHub
GPU-scheduler-for-deep-learning
☆215Nov 5, 2020Updated 5 years ago
joyme123 / cola-device-plugin
View on GitHub
kubernetes device plugin的开发示例
☆36Mar 24, 2020Updated 6 years ago
awslabs / aws-virtual-gpu-device-plugin
View on GitHub
AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads
☆202Nov 22, 2023Updated 2 years ago
Project-HAMi / volcano-vgpu-device-plugin
View on GitHub
Device-plugin for volcano vgpu which support hard resource isolation
☆161Jun 9, 2026Updated last month
Mr-Linus / Yoda-Scheduler
View on GitHub
Yoda is a kubernetes scheduler based on GPU metrics. Yoda是一个基于GPU参数指标的 Kubernetes 调度器
☆137Mar 27, 2022Updated 4 years ago
kubeflow / trainer
View on GitHub
Distributed AI Model Training and LLM Fine-Tuning on Kubernetes
☆2,140Updated this week
Bruce-Lee-LY / cuda_hook
View on GitHub
Hooked CUDA-related dynamic libraries by using automated code generation tools.
☆173Dec 12, 2023Updated 2 years ago
kubernetes-sigs / scheduler-plugins
View on GitHub
Repository for out-of-tree scheduler plugins based on scheduler framework.
☆1,302Updated this week
gpucloud / k8s-device-plugin
View on GitHub
NVIDIA device plugin for Kubernetes
☆15Sep 9, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sakjain92 / Fractional-GPUs
View on GitHub
Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions
☆164Apr 21, 2019Updated 7 years ago
volcano-sh / resource-exporter
View on GitHub
Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.
☆19Jul 8, 2026Updated last week
kleveross / ormb
View on GitHub
Docker for Your ML/DL Models Based on OCI Artifacts
☆473Jan 26, 2024Updated 2 years ago
NVIDIA / dcgm-exporter
View on GitHub
NVIDIA GPU metrics exporter for Prometheus leveraging DCGM
☆1,802May 12, 2026Updated 2 months ago
coldfunction / qCUDA
View on GitHub
qCUDA: GPGPU Virtualization at a New API Remoting Method with Para-virtualization
☆136Feb 9, 2022Updated 4 years ago
koordinator-sh / koordinator
View on GitHub
A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, …
☆1,720Updated this week
NVIDIA / kubevirt-gpu-device-plugin
View on GitHub
NVIDIA k8s device plugin for Kubevirt
☆286Updated this week