4paradigm / k8s-vgpu-scheduler
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
☆560Updated 10 months ago
Alternatives and similar repositories for k8s-vgpu-scheduler:
Users that are interested in k8s-vgpu-scheduler are comparing it to the libraries listed below
- ☆525Updated 10 months ago
- ☆874Updated last year
- Heterogeneous AI Computing Virtualization Middleware☆1,492Updated this week
- GPU Sharing Device Plugin for Kubernetes Cluster☆480Updated 2 years ago
- Using CRDs to manage GPU resources in Kubernetes.☆197Updated 2 years ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆152Updated this week
- ☆132Updated 3 years ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆124Updated 3 years ago
- ☆274Updated last year
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆518Updated last year
- ☆51Updated 3 weeks ago
- Device plugins for Volcano, e.g. GPU☆118Updated 3 weeks ago
- Large language model fine-tuning capabilities based on cloud native and distributed computing.☆92Updated last year
- ☆244Updated last week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,461Updated last year
- kubeflow国内一键安装文件☆346Updated 2 years ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆140Updated 2 years ago
- Device-plugin for volcano vgpu which support hard resource isolation☆71Updated 3 weeks ago
- NVIDIA k8s device plugin for Kubevirt☆252Updated last month
- Kubeflow helm chart☆144Updated last year
- Share GPU between Pods in Kubernetes☆210Updated 2 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆85Updated 3 months ago
- A CLI for Kubeflow.☆765Updated last week
- GPU plugin to the node feature discovery for Kubernetes☆300Updated 10 months ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆473Updated this week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆341Updated this week
- OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development…☆97Updated 3 years ago
- ☆120Updated last month
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆397Updated this week
- ☆35Updated 4 years ago