lorin / resilience-engineeringLinks
Resilience engineering papers
☆3,002Updated 3 months ago
Alternatives and similar repositories for resilience-engineering
Users that are interested in resilience-engineering are comparing it to the libraries listed below
Sorting:
- A collection of the papers, conference talks, articles, blog posts, interesting Twitter threads, HN/reddit comments on systems engineerin…☆567Updated 5 years ago
- A collection of postmortem templates☆1,389Updated 2 years ago
- Compilation of public failure/horror stories related to Kubernetes☆6,215Updated 5 years ago
- Curated list of resources on testing distributed systems☆2,573Updated last month
- Systems and failure reading list☆200Updated 3 years ago
- Queueing theory: an introduction for software development☆2,172Updated 5 months ago
- Docker, Kubernetes and Gravity Trainings by Gravitational☆2,029Updated 2 years ago
- ☆3,429Updated 4 years ago
- ⚙️ A Gentle introduction to Kubernetes with more than just the basics. 🌟 Give it a star if you like it.☆3,230Updated 2 years ago
- Prometheus-Basics is part of Prometheus Docs now, checkout 👇☆1,571Updated 4 years ago
- Class materials for a distributed systems lecture series☆9,221Updated 6 months ago
- Ideas for creating and sustaining high performance organizations☆1,257Updated last week
- A reading list for services engineering, with a focus on cloud infrastructure services☆3,662Updated 3 years ago
- A developer's guide to management: an open-sourced handbook for leading software engineering teams.☆1,565Updated 5 years ago
- For when people get too hyped up about things☆7,295Updated last year
- Chaos Engineering Toolkit & Orchestration for Developers☆1,959Updated last year
- A curated list of Chaos Engineering resources.☆6,398Updated last year
- A collection of postmortems. Sorry for the delay in merging PRs!☆11,681Updated last month
- systems is a set of tools for describing, running and visualizing systems diagrams.☆394Updated 4 months ago
- The Open Architecture Playbook. Use it to create better and faster (IT)Architectures. OSS Tools, templates and more for solving IT proble…☆730Updated 3 weeks ago
- Monzo's real-time incident response and reporting tool ⚡️☆1,546Updated last year
- A distributed, fault-tolerant pipeline for observability data☆1,742Updated last year
- Techniques and numbers for estimating system's performance from first-principles☆4,587Updated last year
- A collection of engineering ladders for reference and inspiration☆588Updated last year
- A collection of debugging stories. PRs welcome (sorry for the backlog) :-)☆3,816Updated last year
- Heuristics for effective management☆5,391Updated last year
- This is a collection of readings, talks, and other bits regarding the field of Resilience Engineering☆226Updated 6 years ago
- repositories of my talks☆278Updated 5 years ago
- Ways of Working (WoW) with team principles, values, tenets, ground rules, aspirations, norms, working agreements, shared expectations, an…☆704Updated 3 months ago
- Books for people who are or aspire to manage/lead team(s) of software engineers☆1,653Updated last year