alicegoldfuss / oncall-handbookLinks
Tips and tricks for getting through on-call
☆401Updated 5 years ago
Alternatives and similar repositories for oncall-handbook
Users that are interested in oncall-handbook are comparing it to the libraries listed below
Sorting:
- Leading Groups at Etsy to Learn From Accidents☆255Updated 11 months ago
- This is a collection of readings, talks, and other bits regarding the field of Resilience Engineering☆226Updated 6 years ago
- A tool for performing actions on GitHub repos or a single repo.☆359Updated 2 years ago
- On call alert classification and reporting☆761Updated 7 years ago
- For auditing what collaborators, hooks, and deploy keys you have added on all your GitHub repositories.☆335Updated 5 years ago
- Reviews of published Postmortem Reports☆57Updated 8 years ago
- A Chaos Engineering Bootcamp☆172Updated 7 years ago
- HumanOps deliberately highlights the importance of the teams running systems, not just the systems themselves.☆270Updated 3 years ago
- A GitHub Bot to automatically delete your fork's branches after a pull request has been merged.☆288Updated 4 years ago
- PagerDuty on-call widget for monitoring dashboard. Datadog and Grafana compatible☆343Updated 2 years ago
- Guidance on how to make your environment easier to onboard for Web Ops Engineers, SRE's and DevOps Practitioners☆285Updated 11 months ago
- A collection of Twilio SRE's Gameday Templates☆139Updated 4 years ago
- ☆70Updated 5 years ago
- A set of Terraform modules for configuring production infrastructure with AWS☆2,100Updated 2 years ago
- Documents and resources for the "Learning from Incidents in Software" slack workspace.☆39Updated 4 years ago
- The curriculum for apprentice-level engineers at FairWinds Ops☆95Updated 4 years ago
- Run Book / Operations Manual template for modern software systems☆715Updated 5 years ago
- What to Read to Learn More About DevOps☆456Updated 2 years ago
- A curated list of amazingly awesome Chef resources☆142Updated 9 years ago
- Catalog of valuable metrics you might want to collect☆346Updated 10 years ago
- Cloud Native Infrastructure BackUp & RecoveRY☆256Updated 6 years ago
- A checklist for ensuring that code reviews are thorough☆45Updated 6 years ago
- A bot for keeping your ssh authorized_keys up to date with user's GitHub keys, **only** use if you enable 2FA & keep your keys updates.☆280Updated 4 years ago
- Disaster Recovery and Configuration Management for Consul and Vault☆187Updated last year
- post mortem tracker☆1,018Updated 5 years ago
- Configs and scripts for bootstrapping an opinionated Kubernetes cluster anywhere.☆395Updated 6 years ago
- Ansible dynamic inventory script for parsing Terraform state files☆452Updated 6 years ago
- Decommissioned: Puppet manifests that used to provision the legacy GOV.UK stack.☆127Updated last year
- A toolkit for Kubernetes cluster provisioning and lifecycle management☆266Updated 5 years ago
- How to guide on running HashiCorp's Vault on Google Kubernetes Engine☆387Updated 4 years ago