4paradigm / k8s-vgpu-scheduler
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
☆554Updated 10 months ago
Alternatives and similar repositories for k8s-vgpu-scheduler:
Users that are interested in k8s-vgpu-scheduler are comparing it to the libraries listed below
- ☆863Updated 11 months ago
- ☆521Updated 9 months ago
- Heterogeneous AI Computing Virtualization Middleware☆1,395Updated this week
- GPU Sharing Device Plugin for Kubernetes Cluster☆478Updated 2 years ago
- Using CRDs to manage GPU resources in Kubernetes.☆197Updated 2 years ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆144Updated 3 weeks ago
- ☆131Updated 3 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆517Updated last year
- ☆51Updated 2 months ago
- ☆274Updated last year
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆124Updated 3 years ago
- kubeflow国内一键安装文件☆344Updated 2 years ago
- GPU Sharing Scheduler for Kubernetes Cluster☆1,456Updated last year
- Kubeflow helm chart☆143Updated last year
- Device plugins for Volcano, e.g. GPU☆117Updated this week
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆140Updated 2 years ago
- ☆237Updated this week
- Kubernetes Operator for AI and Bigdata Elastic Training☆85Updated 2 months ago
- NVIDIA k8s device plugin for Kubevirt☆248Updated last week
- Share GPU between Pods in Kubernetes☆210Updated 2 years ago
- GPU plugin to the node feature discovery for Kubernetes☆298Updated 9 months ago
- Large language model fine-tuning capabilities based on cloud native and distributed computing.☆92Updated last year
- A CLI for Kubeflow.☆760Updated last week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆469Updated last week
- OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development…☆97Updated 3 years ago
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆330Updated last week
- Device-plugin for volcano vgpu which support hard resource isolation☆67Updated this week
- cloud-native local storage management system for stateful workload, low-latency with simplicity☆477Updated 3 months ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆195Updated 3 years ago
- Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs☆348Updated 11 months ago