last9 / awesome-prometheus-toolkitLinks
Alert rules toolkit for Prometheus. Connect Prometheus, discover alert rules, apply!
☆35Updated 6 months ago
Alternatives and similar repositories for awesome-prometheus-toolkit
Users that are interested in awesome-prometheus-toolkit are comparing it to the libraries listed below
Sorting:
- A tool to analyze your TSDB Cardinality stats, to stay one step ahead in knowing the system better☆13Updated last year
- SLOs, Error windows and alerts are complicated. Here an attempt to make it easy☆133Updated 8 months ago
- A glossary of all terms related to Observability, starting from A to Z!☆35Updated last year
- Do more with your metrics☆30Updated 2 years ago
- APM for NodeJS using Prometheus☆66Updated 3 months ago
- Create custom DevOps AI agents that understand and manage your infrastructure.☆83Updated 8 months ago
- Sample applications of supported integrations by Last9 Products☆13Updated last year
- k9s like CLI for AWS and GCP☆575Updated last year
- Your 24/7 On-Call AI Agent - Solve Alerts Faster with Automatic Correlations, Investigations, and More☆1,508Updated this week
- A curated list of AI-powered DevOps & SRE (Site Reliability Engineering) agents, tools, and resources for automating and enhancing reliab…☆37Updated last month
- Curated list of resources on SLOs☆277Updated 5 months ago
- Grafana dashboard for OpenTelemetry services☆25Updated 2 months ago
- Prometheus Exporter for Cloud Provider agnostic cost metrics☆107Updated this week
- Terraform modules for creating Nomad servers and clients nodes on AWS.☆160Updated last week
- Simplified Kubernetes Clusters Lifecycle Management (Core)☆266Updated 3 weeks ago
- Open Source Incident Management tool for the cloud native ecosystem☆56Updated 2 months ago
- Technical Advisory Group for Observability 🔭⚙️☆723Updated 8 months ago
- Simple but still extremely powerful K9S alternative. An interactive `explain` command. Security scanning based on `trivy`. Supports multi…☆122Updated 6 months ago
- An Opinionated Roadmap to Become an SRE (Concepts > Tools)☆501Updated last year
- DIAL(Did I Alert Lambda?) is a centralised security misconfiguration detection framework which completely runs on AWS Managed services li…☆94Updated 3 years ago
- Making SLOs with Prometheus manageable, accessible, and easy to use for everyone!☆1,436Updated this week
- Telegram channels & groups about DevOps, SRE, and Platform Engineering.☆265Updated last week
- View k8s in graphical fashion☆208Updated 7 months ago
- A tool for generating files and folders ("boilerplate") from a set of templates☆297Updated this week
- Web-based tool to facilitate OpenTelemetry collector configuration editing and verification☆460Updated this week
- The Open Source Incident Management Framework☆154Updated last week
- A vocabulary collection for SREs☆231Updated 3 years ago
- Map Kubernetes traffic: in-cluster, to the Internet, and to AWS IAM and export as text, intents, or an image☆663Updated 5 months ago
- A compilation of resources for SRE interview preparation☆21Updated 2 years ago
- Kubernetes Native Health Check Platform☆286Updated this week