NVIDIA GPU metrics exporter for Prometheus leveraging DCGM
☆1,786May 12, 2026Updated last month
Alternatives and similar repositories for dcgm-exporter
Users that are interested in dcgm-exporter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆750Jun 11, 2026Updated 3 weeks ago
- Nvidia GPU exporter for prometheus using nvidia-smi binary☆1,504Updated this week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,752Jun 27, 2026Updated last week
- Golang bindings for Nvidia Datacenter GPU Manager (DCGM)☆154Jun 22, 2026Updated last week
- NVIDIA device plugin for Kubernetes☆3,797Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Go Bindings for the NVIDIA Management Library (NVML)☆445Jun 26, 2026Updated last week
- Heterogeneous GPU Sharing on Kubernetes☆3,623Jun 27, 2026Updated last week
- GPU plugin to the node feature discovery for Kubernetes☆309May 27, 2024Updated 2 years ago
- Tools for monitoring NVIDIA GPUs on Linux☆1,074Nov 2, 2021Updated 4 years ago
- ☆372Jun 22, 2026Updated last week
- Exporter for machine metrics☆13,552Jun 26, 2026Updated last week
- A Cloud Native Batch System (Project under CNCF)☆5,714Jun 27, 2026Updated last week
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆54Jun 24, 2026Updated last week
- A toolkit to run Ray applications on Kubernetes☆2,562Jun 26, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆748Jun 26, 2026Updated last week
- Node feature discovery for Kubernetes☆1,052Jun 26, 2026Updated last week
- Build and run containers leveraging NVIDIA GPUs☆4,440Jun 27, 2026Updated last week
- DRA Driver for NVIDIA GPUs☆664Updated this week
- MIG Partition Editor for NVIDIA GPUs☆255Updated this week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,533Dec 29, 2023Updated 2 years ago
- This is a place for various problem detectors running on the Kubernetes nodes.☆3,424Updated this week
- Add-on agent to generate and expose cluster-level metrics.☆6,141Jun 23, 2026Updated last week
- NVIDIA k8s device plugin for Kubevirt☆286Jun 24, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,350Updated this week
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,631Updated this week
- NVIDIA Network Operator☆353Jun 25, 2026Updated last week
- ☆900Apr 2, 2024Updated 2 years ago
- A tool for bandwidth measurements on NVIDIA GPUs.☆723Apr 8, 2026Updated 2 months ago
- NCCL Tests☆1,567Jun 25, 2026Updated last week
- NVIDIA container runtime library☆1,113Updated this week
- Kubernetes-native Job Queueing☆2,647Updated this week
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆313Updated this week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Prometheus exporter for a Infiniband Fabric☆70Dec 12, 2023Updated 2 years ago
- Kubernetes Virtualization API and runtime in order to define and manage virtual machines.☆6,926Updated this week
- Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration☆5,506Jun 25, 2026Updated last week
- Scalable and efficient source of container resource metrics for Kubernetes built-in autoscaling pipelines.☆6,655Jun 24, 2026Updated last week
- Automated management of large-scale applications on Kubernetes (incubating project under CNCF)☆5,276Jun 21, 2026Updated last week
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,297Jun 24, 2026Updated last week
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,127Jun 26, 2026Updated last week