grgalex / nvshareLinks
Practical GPU Sharing Without Memory Size Constraints
☆304Updated 10 months ago
Alternatives and similar repositories for nvshare
Users that are interested in nvshare are comparing it to the libraries listed below
Sorting:
- NVIDIA DRA Driver for GPUs☆557Updated this week
- MIG Partition Editor for NVIDIA GPUs☆240Updated this week
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆145Updated 3 years ago
- GPU plugin to the node feature discovery for Kubernetes☆307Updated last year
- Share GPU between Pods in Kubernetes☆216Updated 3 years ago
- ☆212Updated this week
- ☆334Updated last week
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆275Updated last week
- NVIDIA k8s device plugin for Kubevirt☆278Updated last week
- cricket is a virtualization solution for GPUs☆234Updated 5 months ago
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆88Updated last year
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆513Updated last week
- Device plugins for Volcano, e.g. GPU☆132Updated 10 months ago
- NVIDIA Network Operator☆320Updated this week
- CUDA checkpoint and restore utility☆410Updated 4 months ago
- GPUd automates monitoring, diagnostics, and issue identification for GPUs☆475Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆662Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆308Updated this week
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆371Updated 3 weeks ago
- A Slurm cluster for Kubernetes☆68Updated last year
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆127Updated 3 years ago
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆287Updated 2 weeks ago
- ☆282Updated 2 weeks ago
- ☆540Updated last year
- ☆68Updated last year
- NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated compu…☆177Updated this week
- Hooked CUDA-related dynamic libraries by using automated code generation tools.☆172Updated 2 years ago
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆165Updated 6 years ago
- NVIDIA NCCL Tests for Distributed Training☆134Updated 2 weeks ago
- Holistic job manager on Kubernetes☆116Updated last year