grgalex / nvshare
Practical GPU Sharing Without Memory Size Constraints
☆200Updated last month
Related projects: ⓘ
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆134Updated last year
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆227Updated this week
- NVIDIA k8s device plugin for Kubevirt☆222Updated last month
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆118Updated 2 years ago
- ☆187Updated this week
- Device plugins for Volcano, e.g. GPU☆98Updated last week
- MIG Partition Editor for NVIDIA GPUs☆163Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆287Updated 3 months ago
- cricket is a virtualization solution for GPUs☆139Updated 8 months ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆72Updated 2 weeks ago
- NVIDIA NCCL Tests for Distributed Training☆59Updated last month
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆67Updated 5 months ago
- elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.☆54Updated 2 years ago
- ☆490Updated 3 months ago
- Share GPU between Pods in Kubernetes☆196Updated last year
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆258Updated this week
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆152Updated 5 years ago
- Hooked CUDA-related dynamic libraries by using automated code generation tools.☆126Updated 9 months ago
- ☆126Updated 3 years ago
- Using CRDs to manage GPU resources in Kubernetes.☆185Updated last year
- CUDA checkpoint and restore utility☆193Updated 5 months ago
- ☆202Updated 2 weeks ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆133Updated this week
- Kubernetes Operator for AI and Bigdata Elastic Training☆84Updated last month
- Controller for ModelMesh☆200Updated 2 months ago
- NVIDIA device plugin for Kubernetes☆42Updated 7 months ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆426Updated 4 months ago
- Mellanox Network Operator☆201Updated last week
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆200Updated 9 months ago
- ☆32Updated 3 weeks ago