Run Book / Operations Manual template for modern software systems
☆721Aug 21, 2019Updated 6 years ago
Alternatives and similar repositories for run-book-template
Users that are interested in run-book-template are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Questions to assess the operability of software systems☆17Apr 4, 2019Updated 6 years ago
- Questions to assess the testability of software systems☆20Oct 24, 2019Updated 6 years ago
- A set of Grafana dashboards and Prometheus alerts for Kubernetes.☆2,405Updated this week
- A collection templates ported from the SRE Workbook☆42Aug 24, 2018Updated 7 years ago
- The Multi-team Software Delivery Assessment is a simple, easy-to-execute approach to assessing software delivery across many different te…☆210Aug 22, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A curated list of Site Reliability and Production Engineering resources.☆13,083Aug 28, 2025Updated 7 months ago
- ☆17Sep 27, 2022Updated 3 years ago
- Manage application's SLI and SLO's easily with the application lifecycle inside a Kubernetes cluster☆277Jun 11, 2021Updated 4 years ago
- A simple template for a wiki page for a TVP (thinnest viable platform) - as explained in the Team Topologies book☆66Mar 5, 2021Updated 5 years ago
- Terragrunt is a flexible orchestration tool that allows Infrastructure as Code written in OpenTofu/Terraform to scale.☆9,421Updated this week
- Write tests against structured configuration data using the Open Policy Agent Rego query language☆3,145Updated this week
- Kubediff: a tool for Kubernetes to show differences between running state and version controlled configuration.☆1,179Oct 24, 2023Updated 2 years ago
- Validation of best practices in your Kubernetes clusters☆3,356Mar 23, 2026Updated last week
- A powerful testing tool for Kubernetes clusters.☆1,977Nov 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A collection of step by step guides for fixing common tech problems.☆128Feb 13, 2023Updated 3 years ago
- PagerDuty's Incident Response Documentation.☆1,037Jan 7, 2025Updated last year
- Tips and tricks for getting through on-call☆408Jun 13, 2020Updated 5 years ago
- Backup and migrate Kubernetes applications and their persistent volumes☆9,925Updated this week
- Validate your Kubernetes configuration files, supports multiple Kubernetes versions☆3,225Jan 29, 2026Updated 2 months ago
- Terratest is a Go library that makes it easier to write automated tests for your infrastructure code.☆7,887Updated this week
- 🦥 Easy and simple Prometheus SLO (service level objectives) generator☆2,437Updated this week
- Jsonnet library for generating Grafana dashboard files.☆1,079Jun 26, 2023Updated 2 years ago
- post mortem tracker☆1,023Oct 1, 2019Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Generate documentation from Terraform modules in various output formats☆4,721Dec 18, 2025Updated 3 months ago
- A collection of postmortem templates☆1,420Jul 12, 2023Updated 2 years ago
- 👀 A Kubernetes cluster resource sanitizer☆6,253Dec 8, 2025Updated 3 months ago
- Website for the DevOps Team Topologies at devopstopologies.com☆97Mar 21, 2023Updated 3 years ago
- A logging library designed with operations in mind.☆12Dec 6, 2017Updated 8 years ago
- Chaos Engineering Toolkit & Orchestration for Developers☆2,003Jul 20, 2024Updated last year
- A list of common Disaster Recovery (DR) scenarios for software companies☆34Sep 13, 2021Updated 4 years ago
- jsonnet library to patch objects loaded from yaml☆17Oct 24, 2023Updated 2 years ago
- Guidance on how to make your environment easier to onboard for Web Ops Engineers, SRE's and DevOps Practitioners☆292Jul 18, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resour…☆5,953Updated this week
- A wrapper to send shell command results to sensu☆21Oct 3, 2022Updated 3 years ago
- Open specification for defining and expressing service level objectives (SLO)☆1,480Nov 25, 2025Updated 4 months ago
- Quick and Easy server testing/validation☆5,874May 1, 2025Updated 10 months ago
- Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes☆9,868Updated this week
- Monzo's real-time incident response and reporting tool ⚡️☆1,554Mar 20, 2024Updated 2 years ago
- Vulnerability Static Analysis for Containers☆10,951Updated this week