res-eng / incident-writeupsLinks
A curated list of well-written publicly available incident writeups
☆13Updated 6 years ago
Alternatives and similar repositories for incident-writeups
Users that are interested in incident-writeups are comparing it to the libraries listed below
Sorting:
- AWS EBS-EC2 attach utility. UNMAINTAINED, SEE FORK ->☆29Updated 2 years ago
- A tool for flashing OS images onto stateful servers☆47Updated 5 years ago
- ☆70Updated 6 years ago
- Expose AWS service usage and limits to Prometheus☆47Updated last week
- A GitHub App that uses kubeval to validate all of that Kubernetes YAML in your repo☆94Updated 4 years ago
- Periodically run a command and exports its return code as a prometheus metric.☆118Updated 2 years ago
- Dogscaler scales up AWS autoscale groups based on the results of a datadog query.☆16Updated 2 weeks ago
- A repo of links to articles, papers, conference talks, and tooling related to load management in software services.☆194Updated 2 years ago
- Images and links to references for Kafka Fault Tree Analysis talks by Andrey Falko☆21Updated 4 years ago
- What I wish I knew before going oncall☆12Updated 6 years ago
- A daemon for responding to AWS AutoScaling Lifecycle Hooks☆147Updated 2 weeks ago
- Encrypted environment variables via AWS KMS☆29Updated 2 years ago
- Look up region and other information for any AWS IP address☆90Updated 2 years ago
- A CLI tool providing you with status & configuration of a Kubernetes cluster fleet☆109Updated last year
- Example code for the blog post on auto-joining a Consul cluster on AWS EC2.☆63Updated 2 weeks ago
- A @HashiCorp Terraform provider for managing Google Calendar events.☆138Updated 5 years ago
- Vault Unseal automation☆131Updated 6 years ago
- The Agile Operations methodology☆145Updated 2 years ago
- Sherpa is a highly available, fast, and flexible horizontal job scaling for HashiCorp Nomad. It is capable of running in a number of diff…☆162Updated 5 years ago
- A collection of Twilio SRE's Gameday Templates☆140Updated 5 years ago
- ## Auto-archived due to inactivity. ## Go program to move data in and out of Consul's KV store.☆128Updated 4 years ago
- Simplistic chaos engineering tool for kubernetes application resilience testing☆37Updated 2 years ago
- Keep your Consul service catalog in sync with your RDS instances☆16Updated last month
- This is a collection of readings, talks, and other bits regarding the field of Resilience Engineering☆226Updated 7 years ago
- Kubernetes Resource Explorer☆135Updated 7 years ago
- Simple, elastic Kubernetes cluster autoscaler for AWS Auto Scaling Groups☆93Updated 6 years ago
- Helps to prevent Consul from firing prematurely.☆71Updated 9 years ago
- Generate Prometheus alerting & recording rules and Grafana dashboards for your SLOs.☆118Updated 3 years ago
- The Consul-Native Service Mesh☆64Updated 7 years ago
- Firehose all nomad job, allocation, nodes and evaluations changes to rabbitmq, kinesis or stdout☆116Updated 5 years ago