A role-playing game for incident management training
☆195Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for wheel-of-misfortune
Users that are interested in wheel-of-misfortune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WebAMS is an Open Source web application for reporting and resolving incidents or tickets☆10Dec 11, 2022Updated 3 years ago
- A list of common Disaster Recovery (DR) scenarios for software companies☆35Sep 13, 2021Updated 4 years ago
- A collection of postmortem templates☆1,436Jul 12, 2023Updated 2 years ago
- List Kubernetes objects in a problematic state☆62Aug 26, 2021Updated 4 years ago
- Calculate how much downtime should be permitted in your Service Level Agreement or Objective☆68Feb 14, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A GitHub action that organizes your post-mortems☆17Oct 12, 2022Updated 3 years ago
- ☆12Updated this week
- A collection of Twilio SRE's Gameday Templates☆140Oct 13, 2020Updated 5 years ago
- Grafana's Loki integrated with kube-prometheus monitoring stack☆16Apr 22, 2019Updated 7 years ago
- DEPRECATED Collection of python scripts to run failure injection on AWS infrastructure☆93Oct 18, 2023Updated 2 years ago
- A curated list of Site Reliability and Production Engineering Tools☆1,462May 13, 2026Updated last month
- Terraform Automation and Collaboration tools (TACOS) pricing calculator☆17Aug 14, 2023Updated 2 years ago
- Much resources. So log. Wow.☆24May 17, 2014Updated 12 years ago
- A sample of major outages and incidents☆19Jul 27, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Instruments code for collecting data coverage (instead of code coverage)☆10May 5, 2017Updated 9 years ago
- Fixed it, so that years actually make sense, instead of AD and BC nonsense☆14Mar 21, 2025Updated last year
- Easy setup a service level objective using prometheus☆137Jan 10, 2026Updated 5 months ago
- conference conspects☆21May 25, 2020Updated 6 years ago
- The end to end testing tool for @crossplane providers and configurations.☆28Mar 8, 2026Updated 3 months ago
- Create an incident response triage toolkit for use with Windows or Linux.☆18Jun 14, 2020Updated 5 years ago
- Portable Activity Timeline that draws the Timeline based on data given in JSON or CSV format. By clicking on any activity a detailed moda…☆12Apr 6, 2023Updated 3 years ago
- SLO Generator computes SLIs, SLOs, Error Budgets and Burn Rates from supported backends, then exports an SLO report to supported targets.☆562Jun 3, 2026Updated last week
- A proxy to fan out a single request to multiple prometheus servers and merge the responses☆12Feb 22, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A tool to build custom application simulators through declarative configuration☆11Dec 15, 2025Updated 5 months ago
- Manage application's SLI and SLO's easily with the application lifecycle inside a Kubernetes cluster☆277Jun 11, 2021Updated 5 years ago
- GitOps 101☆16Nov 5, 2019Updated 6 years ago
- Run isolated cookbook tests against your chef repository with Strainer.☆110Feb 16, 2015Updated 11 years ago
- Linux Metrics Workshop☆11Jun 30, 2020Updated 5 years ago
- ☆53Dec 20, 2022Updated 3 years ago
- Text Match Cut Video Generator Web App☆37Feb 19, 2026Updated 3 months ago
- Tools for Chaos Engineers☆44Mar 19, 2018Updated 8 years ago
- Create alerts from messages sent to a Slack channel☆28Apr 7, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository is a curated list of pro bono incident response entities.☆21Jun 21, 2023Updated 2 years ago
- A WIP drop in replacement for Prometheus Alertmanager, built on top of Cloudflare Workers☆25Nov 19, 2025Updated 6 months ago
- Calm monitoring extension for the OpenTelemetry Collector☆13Aug 11, 2025Updated 10 months ago
- ☆19Apr 4, 2018Updated 8 years ago
- A curated list of Site Reliability and Production Engineering resources.☆13,257Aug 28, 2025Updated 9 months ago
- A curated list of Chaos Engineering resources.☆6,580Dec 28, 2023Updated 2 years ago
- An example of a service, which "evolves" step by step☆35Nov 10, 2017Updated 8 years ago