4paradigm / k8s-vgpu-scheduler
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
☆515Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for k8s-vgpu-scheduler
- Heterogeneous AI Computing Virtualization Middleware☆897Updated this week
- ☆828Updated 7 months ago
- ☆501Updated 5 months ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆470Updated last year
- Using CRDs to manage GPU resources in Kubernetes.☆191Updated last year
- ☆129Updated 3 years ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆99Updated 3 weeks ago
- GPU Sharing Scheduler for Kubernetes Cluster☆1,409Updated 10 months ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆120Updated 2 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆505Updated 8 months ago
- ☆269Updated last year
- kubeflow国内一键安装文件☆342Updated 2 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆84Updated 3 months ago
- Device plugins for Volcano, e.g. GPU☆103Updated last month
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆135Updated last year
- Kubeflow helm chart☆139Updated last year
- ☆48Updated 3 weeks ago
- A CLI for Kubeflow.☆737Updated this week
- Share GPU between Pods in Kubernetes☆201Updated last year
- Large language model fine-tuning capabilities based on cloud native and distributed computing.☆89Updated 8 months ago
- ☆194Updated last week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆440Updated 3 weeks ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆196Updated 2 years ago
- NVIDIA k8s device plugin for Kubevirt☆230Updated 3 weeks ago
- OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development…☆94Updated 3 years ago
- A CLI for Kubeflow.☆58Updated 10 months ago
- GPU plugin to the node feature discovery for Kubernetes☆291Updated 5 months ago
- Kubernetes Scheduler for Deep Learning☆254Updated 2 years ago
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆308Updated last year
- Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs☆343Updated 7 months ago