nickstenning / learningfromincidents
Links and resources from my talk about how to learn more from incidents!
☆21Updated 4 years ago
Alternatives and similar repositories for learningfromincidents
Users that are interested in learningfromincidents are comparing it to the libraries listed below
Sorting:
- KubeSanity a sanity checking framework for Kubernetes☆84Updated 7 years ago
- Discover microservice. Discovers violations and provides metrics and also API endpoints for K8Guard.☆15Updated 10 months ago
- A collection templates ported from the SRE Workbook☆40Updated 6 years ago
- Rules loading sidecar for Prometheus deployed in Kubernetes☆44Updated 4 years ago
- Continuously deploy feature branches to your Kubernetes cluster☆41Updated 8 years ago
- A sample of major outages and incidents☆18Updated 5 years ago
- Rotator is a tool for rotating credentials on a regular schedule.☆16Updated 2 years ago
- A curated list of awesome Jsonnet projects and mixins☆28Updated 6 years ago
- experimental carbon load testing tool☆84Updated 2 years ago
- Generate SLOs for Prometheus via HTTP with Jsonnet☆39Updated 2 years ago
- Prometheus exporter for aws billing information☆24Updated 5 years ago
- Because Clair needs a friend☆31Updated 6 years ago
- A 'realtime' kubernetes resource linter☆42Updated 10 months ago
- print summary of a kubernetes manifest☆32Updated 2 years ago
- Tool to migrate Prometheus 1.x data directories to the 2.0 format.☆14Updated 7 years ago
- cert-operator creates and manages certificates for Kubernetes clusters running on Giant Swarm☆35Updated 6 months ago
- Nginx based Kubernetes ingress controller for AWS☆58Updated last year
- A tool for flashing OS images onto stateful servers☆46Updated 4 years ago
- easily save grafana annotations from slack mentions and the cli☆68Updated last month
- Determine your cloud provider with a simple HTTP call☆51Updated 3 years ago
- DEPRECATED☆12Updated 7 years ago
- Concourse generic resource, which would allow quickly implement any resource☆39Updated 2 years ago
- Expose AWS service usage and limits to Prometheus☆47Updated last year
- AWS Kubernetes Node Terminator☆21Updated last year
- Generate Prometheus alerting & recording rules and Grafana dashboards for your SLOs.☆120Updated 2 years ago
- a security controller for Kubernetes☆14Updated 6 years ago
- A GitHub App that uses kubeval to validate all of that Kubernetes YAML in your repo☆94Updated 3 years ago
- A curated list of well-written publicly available incident writeups☆13Updated 5 years ago
- Kubernetes cluster provisioning for AWS using CoreOS and Terraform☆28Updated 6 years ago
- A top for your kubernetes cluster☆26Updated 6 years ago