4paradigm / k8s-vgpu-scheduler
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
☆523Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for k8s-vgpu-scheduler
- Heterogeneous AI Computing Virtualization Middleware☆953Updated this week
- ☆504Updated 5 months ago
- ☆832Updated 7 months ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆471Updated last year
- Using CRDs to manage GPU resources in Kubernetes.☆192Updated 2 years ago
- kubeflow国内一键安装文件☆342Updated 2 years ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆105Updated last month
- ☆269Updated last year
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆510Updated 8 months ago
- ☆129Updated 3 years ago
- GPU Sharing Scheduler for Kubernetes Cluster☆1,415Updated 10 months ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆120Updated 2 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆84Updated 3 months ago
- Device plugins for Volcano, e.g. GPU☆104Updated 2 months ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆135Updated 2 years ago
- ☆198Updated 3 weeks ago
- Kubeflow helm chart☆139Updated last year
- ☆48Updated last month
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆311Updated last year
- A CLI for Kubeflow.☆740Updated this week
- Share GPU between Pods in Kubernetes☆201Updated last year
- NVIDIA k8s device plugin for Kubevirt☆232Updated last month
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆197Updated 2 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆440Updated last month
- Large language model fine-tuning capabilities based on cloud native and distributed computing.☆91Updated 8 months ago
- ☆114Updated last week
- OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development…☆94Updated 3 years ago
- ☆33Updated 4 years ago
- GPU plugin to the node feature discovery for Kubernetes☆293Updated 5 months ago
- Kubernetes Scheduler for Deep Learning☆255Updated 2 years ago