Run Book / Operations Manual template for modern software systems
☆719Aug 21, 2019Updated 6 years ago
Alternatives and similar repositories for run-book-template
Users that are interested in run-book-template are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Questions to assess the operability of software systems☆17Apr 4, 2019Updated 7 years ago
- Questions to assess the testability of software systems☆20Oct 24, 2019Updated 6 years ago
- A set of Grafana dashboards and Prometheus alerts for Kubernetes.☆2,417May 19, 2026Updated last week
- A collection templates ported from the SRE Workbook☆44Aug 24, 2018Updated 7 years ago
- The Multi-team Software Delivery Assessment is a simple, easy-to-execute approach to assessing software delivery across many different te…☆210Aug 22, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A curated list of Site Reliability and Production Engineering resources.☆13,230Aug 28, 2025Updated 9 months ago
- ☆17Sep 27, 2022Updated 3 years ago
- Manage application's SLI and SLO's easily with the application lifecycle inside a Kubernetes cluster☆277Jun 11, 2021Updated 4 years ago
- A simple template for a wiki page for a TVP (thinnest viable platform) - as explained in the Team Topologies book☆65Mar 5, 2021Updated 5 years ago
- Terragrunt is a flexible orchestration tool that allows Infrastructure as Code written in OpenTofu/Terraform to scale.☆9,579May 21, 2026Updated last week
- Write tests against structured configuration data using the Open Policy Agent Rego query language☆3,172Updated this week
- Kubediff: a tool for Kubernetes to show differences between running state and version controlled configuration.☆1,181Oct 24, 2023Updated 2 years ago
- Validation of best practices in your Kubernetes clusters☆3,368May 19, 2026Updated last week
- A powerful testing tool for Kubernetes clusters.☆1,977Nov 10, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A collection of step by step guides for fixing common tech problems.☆128Feb 13, 2023Updated 3 years ago
- PagerDuty's Incident Response Documentation.☆1,044Apr 9, 2026Updated last month
- Tips and tricks for getting through on-call☆402Jun 13, 2020Updated 5 years ago
- Backup and migrate Kubernetes applications and their persistent volumes☆10,030May 22, 2026Updated last week
- Validate your Kubernetes configuration files, supports multiple Kubernetes versions☆3,227Jan 29, 2026Updated 4 months ago
- Terratest is a Go library that makes it easier to write automated tests for your infrastructure code.☆7,917Updated this week
- 🦥 Easy and simple Prometheus SLO (service level objectives) generator☆2,487May 6, 2026Updated 3 weeks ago
- Jsonnet library for generating Grafana dashboard files.☆1,076Jun 26, 2023Updated 2 years ago
- post mortem tracker☆1,018Oct 1, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Generate documentation from Terraform modules in various output formats☆4,782May 10, 2026Updated 2 weeks ago
- A collection of postmortem templates☆1,436Jul 12, 2023Updated 2 years ago
- 👀 A Kubernetes cluster resource sanitizer☆6,283Dec 8, 2025Updated 5 months ago
- Website for the DevOps Team Topologies at devopstopologies.com☆98Mar 21, 2023Updated 3 years ago
- A logging library designed with operations in mind.☆12Dec 6, 2017Updated 8 years ago
- Chaos Engineering Toolkit & Orchestration for Developers☆2,012Jul 20, 2024Updated last year
- A list of common Disaster Recovery (DR) scenarios for software companies☆35Sep 13, 2021Updated 4 years ago
- jsonnet library to patch objects loaded from yaml☆17Oct 24, 2023Updated 2 years ago
- Guidance on how to make your environment easier to onboard for Web Ops Engineers, SRE's and DevOps Practitioners☆292Jul 18, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resour…☆5,988May 20, 2026Updated last week
- A wrapper to send shell command results to sensu☆21Oct 3, 2022Updated 3 years ago
- Open specification for defining and expressing service level objectives (SLO)☆1,491Nov 25, 2025Updated 6 months ago
- Quick and Easy server testing/validation☆5,894May 1, 2025Updated last year
- Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes☆9,922May 22, 2026Updated last week
- Monzo's real-time incident response and reporting tool ⚡️☆1,557Mar 20, 2024Updated 2 years ago
- Vulnerability Static Analysis for Containers☆10,987Updated this week