res-eng / incident-writeupsLinks
A curated list of well-written publicly available incident writeups
☆13Updated 6 years ago
Alternatives and similar repositories for incident-writeups
Users that are interested in incident-writeups are comparing it to the libraries listed below
Sorting:
- ☆70Updated 5 years ago
- AWS EBS-EC2 attach utility. UNMAINTAINED, SEE FORK ->☆29Updated 2 years ago
- A GitHub App that uses kubeval to validate all of that Kubernetes YAML in your repo☆94Updated 3 years ago
- A tool for flashing OS images onto stateful servers☆47Updated 4 years ago
- Periodically run a command and exports its return code as a prometheus metric.☆117Updated 2 years ago
- A @HashiCorp Terraform provider for managing Google Calendar events.☆137Updated 4 years ago
- Example code for the blog post on auto-joining a Consul cluster on AWS EC2.☆64Updated 2 years ago
- Expose AWS service usage and limits to Prometheus☆47Updated last year
- The Consul-Native Service Mesh☆64Updated 7 years ago
- Dogscaler scales up AWS autoscale groups based on the results of a datadog query.☆16Updated 3 weeks ago
- Automatically rebalance your kafka topics, partitions, replicas across your cluster☆49Updated 7 years ago
- Tool to post graphite annotations to grafana☆16Updated 4 years ago
- Helps to prevent Consul from firing prematurely.☆72Updated 9 years ago
- Kubernetes Resource Explorer☆135Updated 6 years ago
- Sidecar container for requesting dynamic Vault database secrets☆84Updated 7 months ago
- Better Living Through Statistics: Monitoring Doesn't Have To Suck☆159Updated this week
- Nginx based Kubernetes ingress controller for AWS☆58Updated last year
- experimental carbon load testing tool☆84Updated 2 years ago
- Firehose all nomad job, allocation, nodes and evaluations changes to rabbitmq, kinesis or stdout☆115Updated 4 years ago
- [alpha] Emit Datadog monitors based on Kubernetes state.☆86Updated this week
- What I wish I knew before going oncall☆12Updated 5 years ago
- Sherpa is a highly available, fast, and flexible horizontal job scaling for HashiCorp Nomad. It is capable of running in a number of diff…☆162Updated 5 years ago
- A collection templates ported from the SRE Workbook☆41Updated 6 years ago
- Control Consul traffic splitting from the comfort of you mac TouchBar☆16Updated 5 years ago
- The Agile Operations methodology☆146Updated last year
- A tool to create Spinnaker Pipeline JSON from a simple Yaml file☆84Updated last year
- Generate Prometheus alerting & recording rules and Grafana dashboards for your SLOs.☆120Updated 3 years ago
- A daemon for responding to AWS AutoScaling Lifecycle Hooks☆147Updated last month
- A simple kubernetes deployment manager☆169Updated 3 years ago
- This is a collection of readings, talks, and other bits regarding the field of Resilience Engineering☆226Updated 6 years ago