lorin / resilience-engineeringLinks
Resilience engineering papers
☆3,005Updated 4 months ago
Alternatives and similar repositories for resilience-engineering
Users that are interested in resilience-engineering are comparing it to the libraries listed below
Sorting:
- A collection of the papers, conference talks, articles, blog posts, interesting Twitter threads, HN/reddit comments on systems engineerin…☆565Updated 6 years ago
- Queueing theory: an introduction for software development☆2,177Updated 7 months ago
- Compilation of public failure/horror stories related to Kubernetes☆6,213Updated 5 years ago
- A collection of postmortem templates☆1,401Updated 2 years ago
- ⚙️ A Gentle introduction to Kubernetes with more than just the basics. 🌟 Give it a star if you like it.☆3,236Updated 2 years ago
- ☆3,422Updated 4 years ago
- Monzo's real-time incident response and reporting tool ⚡️☆1,545Updated last year
- Systems and failure reading list☆200Updated 3 years ago
- Class materials for a distributed systems lecture series☆9,235Updated 7 months ago
- A curated list of Chaos Engineering resources.☆6,443Updated last year
- Books for people who are or aspire to manage/lead team(s) of software engineers☆1,665Updated last year
- Docker, Kubernetes and Gravity Trainings by Gravitational☆2,028Updated 2 years ago
- Ways of Working (WoW) with team principles, values, tenets, ground rules, aspirations, norms, working agreements, shared expectations, an…☆709Updated 4 months ago
- Notes on David Woods's Resilience Engineering short course☆42Updated 4 years ago
- A reading list for services engineering, with a focus on cloud infrastructure services☆3,663Updated 3 years ago
- A collection of postmortems. Sorry for the delay in merging PRs!☆11,753Updated 2 months ago
- Curated list of resources on testing distributed systems☆2,591Updated last week
- Chaos Engineering Toolkit & Orchestration for Developers☆1,972Updated last year
- The Open Architecture Playbook. Use it to create better and faster (IT)Architectures. OSS Tools, templates and more for solving IT proble…☆732Updated 2 months ago
- Ideas for creating and sustaining high performance organizations☆1,259Updated last month
- Prometheus-Basics is part of Prometheus Docs now, checkout 👇☆1,570Updated 4 years ago
- repositories of my talks☆278Updated 5 years ago
- post mortem tracker☆1,020Updated 6 years ago
- Practical introduction to Prometheus for developers.☆472Updated last year
- For when people get too hyped up about things☆7,340Updated last year
- A developer's guide to management: an open-sourced handbook for leading software engineering teams.☆1,563Updated 5 years ago
- A distributed, fault-tolerant pipeline for observability data☆1,740Updated last year
- Techniques and numbers for estimating system's performance from first-principles☆4,646Updated last year
- systems is a set of tools for describing, running and visualizing systems diagrams.☆396Updated 6 months ago
- Run Book / Operations Manual template for modern software systems☆716Updated 6 years ago