lorin / resilience-engineeringLinks
Resilience engineering papers
☆2,997Updated last month
Alternatives and similar repositories for resilience-engineering
Users that are interested in resilience-engineering are comparing it to the libraries listed below
Sorting:
- A collection of the papers, conference talks, articles, blog posts, interesting Twitter threads, HN/reddit comments on systems engineerin…☆568Updated 5 years ago
- ⚙️ A Gentle introduction to Kubernetes with more than just the basics. 🌟 Give it a star if you like it.☆3,229Updated 2 years ago
- Queueing theory: an introduction for software development☆2,161Updated 3 months ago
- Compilation of public failure/horror stories related to Kubernetes☆6,215Updated 4 years ago
- A collection of postmortem templates☆1,376Updated 2 years ago
- Class materials for a distributed systems lecture series☆9,206Updated 4 months ago
- For when people get too hyped up about things☆7,289Updated last year
- A reading list for services engineering, with a focus on cloud infrastructure services☆3,655Updated 2 years ago
- Ideas for creating and sustaining high performance organizations☆1,253Updated 4 months ago
- Monzo's real-time incident response and reporting tool ⚡️☆1,544Updated last year
- Books for people who are or aspire to manage/lead team(s) of software engineers☆1,649Updated last year
- Prometheus-Basics is part of Prometheus Docs now, checkout 👇☆1,569Updated 4 years ago
- ☆3,427Updated 4 years ago
- Docker, Kubernetes and Gravity Trainings by Gravitational☆2,027Updated 2 years ago
- Curated list of resources on testing distributed systems☆2,562Updated 3 months ago
- Ways of Working (WoW) with team principles, values, tenets, ground rules, aspirations, norms, working agreements, shared expectations, an…☆698Updated last month
- Senior Engineer CheckList☆542Updated 3 years ago
- A collection of postmortems. Sorry for the delay in merging PRs!☆11,599Updated 3 months ago
- Mega list of 1 on 1 meeting questions compiled from a variety to sources☆9,587Updated 2 years ago
- Heuristics for effective management☆5,383Updated last year
- A distributed, fault-tolerant pipeline for observability data☆1,743Updated last year
- A collection of engineering ladders for reference and inspiration☆580Updated last year
- Jari's collection of interesting papers.☆494Updated last week
- A curated list of Chaos Engineering resources.☆6,341Updated last year
- A developer's guide to management: an open-sourced handbook for leading software engineering teams.☆1,567Updated 5 years ago
- Practical introduction to Prometheus for developers.☆456Updated last year
- A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliab…☆9,402Updated last week
- A curated list of Site Reliability and Production Engineering resources.☆12,547Updated last year
- learn awk by example☆740Updated 4 years ago
- Architectural patterns of resilient distributed systems☆1,258Updated 8 years ago