NVIDIA / mig-parted
MIG Partition Editor for NVIDIA GPUs
☆190Updated this week
Alternatives and similar repositories for mig-parted:
Users that are interested in mig-parted are comparing it to the libraries listed below
- NVIDIA NCCL Tests for Distributed Training☆84Updated last week
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆480Updated last month
- GPU plugin to the node feature discovery for Kubernetes☆298Updated 9 months ago
- ☆237Updated this week
- NVIDIA Network Operator☆243Updated this week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆330Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆469Updated last week
- ☆248Updated this week
- Device plugins for Volcano, e.g. GPU☆116Updated 6 months ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆144Updated 2 weeks ago
- ☆60Updated last week
- Share GPU between Pods in Kubernetes☆210Updated 2 years ago
- Controller for ModelMesh☆225Updated 3 weeks ago
- A Slurm cluster for Kubernetes☆55Updated 7 months ago
- Run cloud native workloads on NVIDIA GPUs☆164Updated this week
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆79Updated 11 months ago
- Holistic job manager on Kubernetes☆112Updated last year
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆88Updated 2 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆85Updated 2 months ago
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆312Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆88Updated this week
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆140Updated 2 years ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆194Updated this week
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆158Updated 5 years ago
- Automatic tuning for ML model deployment on Kubernetes☆81Updated 4 months ago
- NVIDIA k8s device plugin for Kubevirt☆248Updated last week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆167Updated this week
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆124Updated 3 years ago
- RDMA and SHARP plugins for nccl library☆183Updated last month
- CUDA checkpoint and restore utility☆306Updated last month