Run Book / Operations Manual template for modern software systems
☆720Aug 21, 2019Updated 6 years ago
Alternatives and similar repositories for run-book-template
Users that are interested in run-book-template are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Questions to assess the operability of software systems☆17Apr 4, 2019Updated 7 years ago
- Questions to assess the testability of software systems☆20Oct 24, 2019Updated 6 years ago
- A set of Grafana dashboards and Prometheus alerts for Kubernetes.☆2,416Updated this week
- A collection templates ported from the SRE Workbook☆44Aug 24, 2018Updated 7 years ago
- The Multi-team Software Delivery Assessment is a simple, easy-to-execute approach to assessing software delivery across many different te…☆210Aug 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of Site Reliability and Production Engineering resources.☆13,176Aug 28, 2025Updated 8 months ago
- ☆17Sep 27, 2022Updated 3 years ago
- Manage application's SLI and SLO's easily with the application lifecycle inside a Kubernetes cluster☆277Jun 11, 2021Updated 4 years ago
- A simple template for a wiki page for a TVP (thinnest viable platform) - as explained in the Team Topologies book☆65Mar 5, 2021Updated 5 years ago
- Terragrunt is a flexible orchestration tool that allows Infrastructure as Code written in OpenTofu/Terraform to scale.☆9,562Updated this week
- Write tests against structured configuration data using the Open Policy Agent Rego query language☆3,166May 1, 2026Updated last week
- Kubediff: a tool for Kubernetes to show differences between running state and version controlled configuration.☆1,181Oct 24, 2023Updated 2 years ago
- Validation of best practices in your Kubernetes clusters☆3,365Updated this week
- A powerful testing tool for Kubernetes clusters.☆1,978Nov 10, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A collection of step by step guides for fixing common tech problems.☆128Feb 13, 2023Updated 3 years ago
- PagerDuty's Incident Response Documentation.☆1,042Apr 9, 2026Updated last month
- Tips and tricks for getting through on-call☆402Jun 13, 2020Updated 5 years ago
- Backup and migrate Kubernetes applications and their persistent volumes☆10,002Updated this week
- Validate your Kubernetes configuration files, supports multiple Kubernetes versions☆3,227Jan 29, 2026Updated 3 months ago
- Terratest is a Go library that makes it easier to write automated tests for your infrastructure code.☆7,912Updated this week
- 🦥 Easy and simple Prometheus SLO (service level objectives) generator☆2,477Updated this week
- Jsonnet library for generating Grafana dashboard files.☆1,077Jun 26, 2023Updated 2 years ago
- post mortem tracker☆1,020Oct 1, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Generate documentation from Terraform modules in various output formats☆4,769Apr 30, 2026Updated last week
- A collection of postmortem templates☆1,432Jul 12, 2023Updated 2 years ago
- 👀 A Kubernetes cluster resource sanitizer☆6,274Dec 8, 2025Updated 5 months ago
- Website for the DevOps Team Topologies at devopstopologies.com☆98Mar 21, 2023Updated 3 years ago
- A logging library designed with operations in mind.☆12Dec 6, 2017Updated 8 years ago
- Chaos Engineering Toolkit & Orchestration for Developers☆2,006Jul 20, 2024Updated last year
- A list of common Disaster Recovery (DR) scenarios for software companies☆35Sep 13, 2021Updated 4 years ago
- jsonnet library to patch objects loaded from yaml☆17Oct 24, 2023Updated 2 years ago
- Guidance on how to make your environment easier to onboard for Web Ops Engineers, SRE's and DevOps Practitioners☆292Jul 18, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- S3 Log Streamer (S3 log forwarder) aims at facilitating the extraction of real time s3 log published and stream it to tcp clients, syslog…☆11May 16, 2016Updated 9 years ago
- Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resour…☆5,980Updated this week
- A wrapper to send shell command results to sensu☆21Oct 3, 2022Updated 3 years ago
- Open specification for defining and expressing service level objectives (SLO)☆1,487Nov 25, 2025Updated 5 months ago
- Quick and Easy server testing/validation☆5,889May 1, 2025Updated last year
- Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes☆9,912Updated this week
- Monzo's real-time incident response and reporting tool ⚡️☆1,556Mar 20, 2024Updated 2 years ago