NVIDIA / go-dcgmLinks
Golang bindings for Nvidia Datacenter GPU Manager (DCGM)
☆122Updated last week
Alternatives and similar repositories for go-dcgm
Users that are interested in go-dcgm are comparing it to the libraries listed below
Sorting:
- Go Bindings for the NVIDIA Management Library (NVML)☆378Updated last month
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆115Updated 2 weeks ago
- Device plugins for Volcano, e.g. GPU☆125Updated 3 months ago
- ☆280Updated last week
- NVIDIA Network Operator☆263Updated last week
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆184Updated this week
- RDMA CNI plugin for containerized workloads☆55Updated 3 weeks ago
- GPU plugin to the node feature discovery for Kubernetes☆301Updated last year
- ☆32Updated 4 years ago
- Device-plugin for volcano vgpu which support hard resource isolation☆95Updated 2 weeks ago
- RDMA device plugin for Kubernetes☆217Updated last year
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆68Updated 2 months ago
- ☆132Updated 4 years ago
- NVIDIA k8s device plugin for Kubevirt☆256Updated 2 weeks ago
- ☆118Updated 2 years ago
- ☆124Updated last week
- ☆62Updated last week
- ☆253Updated 3 weeks ago
- Holistic job manager on Kubernetes☆117Updated last year
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆17Updated last month
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆141Updated 2 years ago
- Kubernetes Rdma SRIOV device plugin☆111Updated 4 years ago
- MIG Partition Editor for NVIDIA GPUs☆204Updated last week
- Using CRDs to manage GPU resources in Kubernetes.☆203Updated 2 years ago
- NVIDIA DRA Driver for GPUs☆395Updated this week
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆126Updated 3 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆487Updated 2 months ago
- A federation scheduler for multi-cluster☆45Updated 3 weeks ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆87Updated 6 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆77Updated last week