Resilience engineering papers
☆3,039Jan 20, 2026Updated 2 months ago
Alternatives and similar repositories for resilience-engineering
Users that are interested in resilience-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Systems and failure reading list☆203Dec 30, 2021Updated 4 years ago
- Documents and resources for the "Learning from Incidents in Software" slack workspace.☆40Oct 13, 2020Updated 5 years ago
- Class materials for a distributed systems lecture series☆9,461Mar 18, 2025Updated last year
- A collection of postmortems. Sorry for the delay in merging PRs!☆11,948Mar 7, 2026Updated 2 weeks ago
- ☆140May 30, 2025Updated 9 months ago
- Introduction to resilience engineering concepts for software engineers☆68Sep 1, 2019Updated 6 years ago
- ☆70Oct 21, 2019Updated 6 years ago
- This is a collection of readings, talks, and other bits regarding the field of Resilience Engineering☆227Nov 2, 2018Updated 7 years ago
- A curated list of Site Reliability and Production Engineering resources.☆13,067Aug 28, 2025Updated 6 months ago
- Notes on David Woods's Resilience Engineering short course☆45Jan 31, 2021Updated 5 years ago
- Examples of OS / system limits☆312Mar 25, 2021Updated 4 years ago
- A curated list of Chaos Engineering resources.☆6,520Dec 28, 2023Updated 2 years ago
- Thoughts on Go performance optimization☆10,894Jan 5, 2022Updated 4 years ago
- 🧠 Laws, Theories, Principles and Patterns for developers and technologists.☆27,020Feb 6, 2026Updated last month
- A curated list of well-written publicly available incident writeups☆13Jul 14, 2019Updated 6 years ago
- For when people get too hyped up about things☆7,346Jan 5, 2024Updated 2 years ago
- Messiness reading list☆56Dec 24, 2021Updated 4 years ago
- Free and Open Source GUI to Visualize Kubernetes Applications.☆1,460Jul 25, 2019Updated 6 years ago
- ☆3,421Feb 9, 2021Updated 5 years ago
- A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliab…☆9,710Nov 17, 2025Updated 4 months ago
- A sample of major outages and incidents☆18Jul 27, 2019Updated 6 years ago
- 📙 Amazon Web Services — a practical guide☆36,588Aug 16, 2024Updated last year
- Compilation of public failure/horror stories related to Kubernetes☆6,212Aug 23, 2020Updated 5 years ago
- Bootstrap Kubernetes the hard way. No scripts.☆47,759Apr 10, 2025Updated 11 months ago
- A collection of the papers, conference talks, articles, blog posts, interesting Twitter threads, HN/reddit comments on systems engineerin…☆567Oct 27, 2019Updated 6 years ago
- Monzo's real-time incident response and reporting tool ⚡️☆1,554Mar 20, 2024Updated 2 years ago
- 😱 Falsehoods Programmers Believe in☆27,185Jan 20, 2026Updated 2 months ago
- 🤔 What happens when I type kubectl run?☆5,081Oct 17, 2023Updated 2 years ago
- A distributed, fault-tolerant pipeline for observability data☆1,742Mar 20, 2024Updated 2 years ago
- A tool for exploring each layer in a docker image☆53,612Dec 15, 2025Updated 3 months ago
- Learn where some of the network sysctl variables fit into the Linux/Kernel network flow. Translations: 🇷🇺☆5,786Feb 20, 2026Updated last month
- Terratest is a Go library that makes it easier to write automated tests for your infrastructure code.☆7,884Updated this week
- Write tests against structured configuration data using the Open Policy Agent Rego query language☆3,142Mar 14, 2026Updated last week
- A collection of postmortem templates☆1,419Jul 12, 2023Updated 2 years ago
- Open-source cloud-environment inspector. Supporting AWS, GCP, Azure, and more! Your cloud resources will have nowhere to hide!☆4,111Mar 5, 2026Updated 2 weeks ago
- ⚙️ A Gentle introduction to Kubernetes with more than just the basics. 🌟 Give it a star if you like it.☆3,231Feb 28, 2023Updated 3 years ago
- A curated list for awesome kubernetes sources☆15,837Feb 27, 2026Updated 3 weeks ago
- Queueing theory: an introduction for software development☆2,196Feb 7, 2026Updated last month
- Heuristics for effective management☆5,494Jan 9, 2026Updated 2 months ago