Run Book / Operations Manual template for modern software systems
☆720Aug 21, 2019Updated 6 years ago
Alternatives and similar repositories for run-book-template
Users that are interested in run-book-template are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Questions to assess the operability of software systems☆17Apr 4, 2019Updated 7 years ago
- Questions to assess the testability of software systems☆20Oct 24, 2019Updated 6 years ago
- A set of Grafana dashboards and Prometheus alerts for Kubernetes.☆2,421Jun 12, 2026Updated last week
- A collection templates ported from the SRE Workbook☆43Aug 24, 2018Updated 7 years ago
- The Multi-team Software Delivery Assessment is a simple, easy-to-execute approach to assessing software delivery across many different te…☆210Aug 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of Site Reliability and Production Engineering resources.☆13,272Aug 28, 2025Updated 9 months ago
- ☆17Sep 27, 2022Updated 3 years ago
- Manage application's SLI and SLO's easily with the application lifecycle inside a Kubernetes cluster☆277Jun 11, 2021Updated 5 years ago
- A simple template for a wiki page for a TVP (thinnest viable platform) - as explained in the Team Topologies book☆65Mar 5, 2021Updated 5 years ago
- Terragrunt is a flexible orchestration tool that allows Infrastructure as Code written in OpenTofu/Terraform to scale.☆9,651Jun 12, 2026Updated last week
- Write tests against structured configuration data using the Open Policy Agent Rego query language☆3,203Updated this week
- Kubediff: a tool for Kubernetes to show differences between running state and version controlled configuration.☆1,181Oct 24, 2023Updated 2 years ago
- Validation of best practices in your Kubernetes clusters☆3,367May 19, 2026Updated last month
- A powerful testing tool for Kubernetes clusters.☆1,975Nov 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A collection of step by step guides for fixing common tech problems.☆128Feb 13, 2023Updated 3 years ago
- PagerDuty's Incident Response Documentation.☆1,046Apr 9, 2026Updated 2 months ago
- Tips and tricks for getting through on-call☆401Jun 13, 2020Updated 6 years ago
- Backup and migrate Kubernetes applications and their persistent volumes☆10,061Jun 12, 2026Updated last week
- Validate your Kubernetes configuration files, supports multiple Kubernetes versions☆3,227Jan 29, 2026Updated 4 months ago
- Terratest is a Go library that makes it easier to write automated tests for your infrastructure code.☆7,929Updated this week
- Jsonnet library for generating Grafana dashboard files.☆1,076Jun 5, 2026Updated 2 weeks ago
- 🦥 Easy and simple Prometheus SLO (service level objectives) generator☆2,501Jun 8, 2026Updated last week
- post mortem tracker☆1,018Oct 1, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Generate documentation from Terraform modules in various output formats☆4,788Jun 10, 2026Updated last week
- A collection of postmortem templates☆1,436Jul 12, 2023Updated 2 years ago
- 👀 A Kubernetes cluster resource sanitizer☆6,303Dec 8, 2025Updated 6 months ago
- Website for the DevOps Team Topologies at devopstopologies.com☆98Mar 21, 2023Updated 3 years ago
- A logging library designed with operations in mind.☆12Dec 6, 2017Updated 8 years ago
- Chaos Engineering Toolkit & Orchestration for Developers☆2,013Jul 20, 2024Updated last year
- A list of common Disaster Recovery (DR) scenarios for software companies☆35Sep 13, 2021Updated 4 years ago
- jsonnet library to patch objects loaded from yaml☆17Oct 24, 2023Updated 2 years ago
- Guidance on how to make your environment easier to onboard for Web Ops Engineers, SRE's and DevOps Practitioners☆293Jul 18, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- S3 Log Streamer (S3 log forwarder) aims at facilitating the extraction of real time s3 log published and stream it to tcp clients, syslog…☆11May 16, 2016Updated 10 years ago
- Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resour…☆6,007Jun 10, 2026Updated last week
- A wrapper to send shell command results to sensu☆21Oct 3, 2022Updated 3 years ago
- Open specification for defining and expressing service level objectives (SLO)☆1,496Nov 25, 2025Updated 6 months ago
- Quick and Easy server testing/validation☆5,904Jun 8, 2026Updated last week
- Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes☆9,936Jun 12, 2026Updated last week
- Monzo's real-time incident response and reporting tool ⚡️☆1,556Mar 20, 2024Updated 2 years ago