Golang bindings for Nvidia Datacenter GPU Manager (DCGM)
☆154Jun 22, 2026Updated last week
Alternatives and similar repositories for go-dcgm
Users that are interested in go-dcgm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Go Bindings for the NVIDIA Management Library (NVML)☆445Updated this week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,786May 12, 2026Updated last month
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆750Jun 11, 2026Updated 2 weeks ago
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆31Jun 24, 2026Updated last week
- A collection of useful Go libraries for use with NVIDIA GPU management tools☆56Jun 21, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆122Jun 21, 2026Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆79Apr 14, 2026Updated 2 months ago
- Tools for monitoring NVIDIA GPUs on Linux☆1,074Nov 2, 2021Updated 4 years ago
- nvloom is a set of tools designed to scalably test MNNVL fabrics.☆49Apr 1, 2026Updated 3 months ago
- InfiniBand fabric monitoring daemon written in Go☆32May 22, 2025Updated last year
- ☆372Jun 22, 2026Updated last week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,752Updated this week
- RDMA CNI plugin for containerized workloads☆60Jun 25, 2026Updated last week
- ☆22May 26, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- kubernetes device plugin的开发示例☆36Mar 24, 2020Updated 6 years ago
- NVIDIA Network Operator☆353Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆27Dec 6, 2024Updated last year
- A Kubernetes Operator to manage Node OS customizations.☆57Updated this week
- ☆900Apr 2, 2024Updated 2 years ago
- Linux Cross-Memory Attach☆102Feb 18, 2026Updated 4 months ago
- MIG Partition Editor for NVIDIA GPUs☆255Updated this week
- ☆543Jun 7, 2024Updated 2 years ago
- A service-aware RoCE network monitoring system based on end- to-end probing.☆29Mar 1, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- NVIDIA device plugin for Kubernetes☆3,797Updated this week
- Linux based user-space RSHIM driver for the Mellanox BlueField SoC☆35Jun 24, 2026Updated last week
- GPU Sharing Device Plugin for Kubernetes Cluster☆495Jan 10, 2023Updated 3 years ago
- ☆303Updated this week
- vcjob Orchestruating Engine☆31Aug 18, 2022Updated 3 years ago
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,297Jun 24, 2026Updated last week
- DRA Driver for NVIDIA GPUs☆662Jun 25, 2026Updated last week
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆54Jun 24, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Using CRDs to manage GPU resources in Kubernetes.☆214Nov 21, 2022Updated 3 years ago
- ☆78Jun 24, 2026Updated last week
- CUDA checkpoint and restore utility☆467Sep 15, 2025Updated 9 months ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆532Mar 4, 2024Updated 2 years ago
- ☆99Jun 4, 2026Updated 3 weeks ago
- RDMA library for mapping associate netdevice and character devices☆81Jun 22, 2026Updated last week