Nvidia GPU exporter for prometheus using nvidia-smi binary
☆1,453Apr 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for nvidia_gpu_exporter
Users that are interested in nvidia_gpu_exporter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,684Apr 7, 2026Updated last week
- Nvidia-smi Prometheus exporter with respecting of GPU-UUID☆37Apr 12, 2023Updated 3 years ago
- NVIDIA device plugin for Kubernetes☆3,720Apr 12, 2026Updated last week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,635Apr 10, 2026Updated last week
- Prometheus exporter for a Infiniband Fabric☆70Dec 12, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NVIDIA GPU Prometheus Exporter☆251Jul 15, 2021Updated 4 years ago
- Exporter for machine metrics☆13,308Apr 11, 2026Updated last week
- Heterogeneous GPU Sharing on Kubernetes☆3,257Apr 10, 2026Updated last week
- Prometheus Exporter for NVIDIA GPUs using NVML☆79Jun 27, 2020Updated 5 years ago
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆705Mar 30, 2026Updated 2 weeks ago
- Prometheus exporter for Windows machines☆3,514Apr 10, 2026Updated last week
- An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.☆6,789Apr 1, 2026Updated 2 weeks ago
- Prometheus exporter that mines /proc to report on selected processes☆2,102Apr 21, 2025Updated 11 months ago
- Go Bindings for the NVIDIA Management Library (NVML)☆431Apr 6, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow app…☆588May 21, 2024Updated last year
- SNMP Exporter for Prometheus☆2,100Apr 10, 2026Updated last week
- Blackbox prober exporter☆5,642Updated this week
- Prometheus exporter for performance metrics from Slurm.☆279Jun 20, 2024Updated last year
- GPU Sharing Scheduler for Kubernetes Cluster☆1,531Dec 29, 2023Updated 2 years ago
- Use Prometheus to monitor Kubernetes and applications running on Kubernetes☆7,626Updated this week
- ☆74Oct 25, 2025Updated 5 months ago
- Tools for monitoring NVIDIA GPUs on Linux☆1,070Nov 2, 2021Updated 4 years ago
- Build and run containers leveraging NVIDIA GPUs☆4,252Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Multi-GPU CUDA stress test☆2,154Nov 4, 2025Updated 5 months ago
- GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm☆10,442Feb 8, 2026Updated 2 months ago
- Prometheus Exporter for Valkey & Redis Metrics. Supports Valkey 9.x, 8.x, 7.x and various Redis versions☆3,609Updated this week
- VictoriaMetrics: fast, cost-effective monitoring solution and time series database☆16,741Updated this week
- Nightingale is to monitoring and alerting what Grafana is to visualization.☆12,956Updated this week
- ☆893Apr 2, 2024Updated 2 years ago
- A Cloud Native Batch System (Project under CNCF)☆5,440Apr 10, 2026Updated last week
- nvidia-smi exporter for Prometheus☆73Jun 7, 2021Updated 4 years ago
- ☆349Apr 10, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Prometheus Alert是开源的运维告警中心消息转发系统,支持主流的监控系统Prometheus,Zabbix,日志系统Graylog和数据可视化系统Grafana发出的预警消息,支持钉钉,微信,华为云短信,腾讯云短信,腾讯云电话,阿里云短信,阿里云电话等☆3,271Feb 19, 2026Updated 2 months ago
- Add-on agent to generate and expose cluster-level metrics.☆6,108Apr 9, 2026Updated last week
- An open source trusted cloud native registry project that stores, signs, and scans content.☆28,274Updated this week
- Remote IPMI exporter for Prometheus☆588Updated this week
- 🚨 Collection of Prometheus alerting rules☆7,833Apr 10, 2026Updated last week
- Simple Redfish (iDRAC, iLO, XClarity) exporter for Prometheus☆268Apr 3, 2026Updated 2 weeks ago
- Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes☆9,896Updated this week