Golang bindings for Nvidia Datacenter GPU Manager (DCGM)
☆148Feb 14, 2026Updated last month
Alternatives and similar repositories for go-dcgm
Users that are interested in go-dcgm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Go Bindings for the NVIDIA Management Library (NVML)☆426Feb 12, 2026Updated last month
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,648Feb 25, 2026Updated 3 weeks ago
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆685Feb 17, 2026Updated last month
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆29Mar 6, 2026Updated 2 weeks ago
- A collection of useful Go libraries for use with NVIDIA GPU management tools☆50Jan 15, 2026Updated 2 months ago
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆122Dec 8, 2025Updated 3 months ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆76Jul 18, 2025Updated 8 months ago
- Tools for monitoring NVIDIA GPUs on Linux☆1,069Nov 2, 2021Updated 4 years ago
- nvloom is a set of tools designed to scalably test MNNVL fabrics.☆43Mar 12, 2026Updated last week
- InfiniBand fabric monitoring daemon written in Go☆32May 22, 2025Updated 10 months ago
- ☆341Updated this week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,598Updated this week
- RDMA CNI plugin for containerized workloads☆60Updated this week
- ☆24Mar 3, 2026Updated 2 weeks ago
- kubernetes device plugin的开发示例☆36Mar 24, 2020Updated 5 years ago
- NVIDIA Network Operator☆327Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- A Kubernetes Operator to manage Node OS customizations.☆48Updated this week
- ☆893Apr 2, 2024Updated last year
- Linux Cross-Memory Attach☆97Feb 18, 2026Updated last month
- MIG Partition Editor for NVIDIA GPUs☆244Updated this week
- A service-aware RoCE network monitoring system based on end- to-end probing.☆24Mar 1, 2026Updated 3 weeks ago
- ☆540Jun 7, 2024Updated last year
- NVIDIA device plugin for Kubernetes☆3,706Updated this week
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆50Updated this week
- Linux based user-space RSHIM driver for the Mellanox BlueField SoC☆35Updated this week
- ☆294Mar 9, 2026Updated 2 weeks ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆492Jan 10, 2023Updated 3 years ago
- vcjob Orchestruating Engine☆30Aug 18, 2022Updated 3 years ago
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- NVIDIA DRA Driver for GPUs☆585Updated this week
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,284Updated this week
- Using CRDs to manage GPU resources in Kubernetes.☆208Nov 21, 2022Updated 3 years ago
- ☆92Dec 28, 2023Updated 2 years ago
- CUDA checkpoint and restore utility☆429Sep 15, 2025Updated 6 months ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- A tool for bandwidth measurements on NVIDIA GPUs.☆645Apr 15, 2025Updated 11 months ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆531Mar 4, 2024Updated 2 years ago
- RDMA library for mapping associate netdevice and character devices☆79Dec 12, 2024Updated last year