Golang bindings for Nvidia Datacenter GPU Manager (DCGM)
☆152Apr 22, 2026Updated last week
Alternatives and similar repositories for go-dcgm
Users that are interested in go-dcgm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Go Bindings for the NVIDIA Management Library (NVML)☆431Apr 14, 2026Updated 2 weeks ago
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,703Apr 7, 2026Updated 3 weeks ago
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆715Apr 21, 2026Updated last week
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆29Updated this week
- A collection of useful Go libraries for use with NVIDIA GPU management tools☆51Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆123Apr 21, 2026Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆77Apr 14, 2026Updated 2 weeks ago
- Tools for monitoring NVIDIA GPUs on Linux☆1,071Nov 2, 2021Updated 4 years ago
- nvloom is a set of tools designed to scalably test MNNVL fabrics.☆47Apr 1, 2026Updated last month
- InfiniBand fabric monitoring daemon written in Go☆32May 22, 2025Updated 11 months ago
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,661Apr 24, 2026Updated last week
- ☆354Updated this week
- RDMA CNI plugin for containerized workloads☆60Updated this week
- ☆24Mar 3, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- kubernetes device plugin的开发示例☆36Mar 24, 2020Updated 6 years ago
- NVIDIA Network Operator☆332Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- A Kubernetes Operator to manage Node OS customizations.☆52Updated this week
- ☆897Apr 2, 2024Updated 2 years ago
- Linux Cross-Memory Attach☆98Feb 18, 2026Updated 2 months ago
- MIG Partition Editor for NVIDIA GPUs☆248Updated this week
- ☆542Jun 7, 2024Updated last year
- NVIDIA device plugin for Kubernetes☆3,738Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Linux based user-space RSHIM driver for the Mellanox BlueField SoC☆36Apr 24, 2026Updated last week
- GPU Sharing Device Plugin for Kubernetes Cluster☆493Jan 10, 2023Updated 3 years ago
- ☆297Apr 16, 2026Updated 2 weeks ago
- vcjob Orchestruating Engine☆30Aug 18, 2022Updated 3 years ago
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆52Updated this week
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,287Apr 21, 2026Updated last week
- DRA Driver for NVIDIA GPUs☆633Updated this week
- ☆76Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CUDA checkpoint and restore utility☆443Sep 15, 2025Updated 7 months ago
- Using CRDs to manage GPU resources in Kubernetes.☆210Nov 21, 2022Updated 3 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆531Mar 4, 2024Updated 2 years ago
- ☆95Mar 30, 2026Updated last month
- A tool for bandwidth measurements on NVIDIA GPUs.☆689Apr 8, 2026Updated 3 weeks ago
- RDMA library for mapping associate netdevice and character devices☆80Mar 25, 2026Updated last month