lorin / resilience-engineering
Resilience engineering papers
☆2,984Updated last month
Alternatives and similar repositories for resilience-engineering:
Users that are interested in resilience-engineering are comparing it to the libraries listed below
- Compilation of public failure/horror stories related to Kubernetes☆6,223Updated 4 years ago
- A collection of the papers, conference talks, articles, blog posts, interesting Twitter threads, HN/reddit comments on systems engineerin…☆568Updated 5 years ago
- Queueing theory: an introduction for software development☆2,155Updated 3 weeks ago
- A curated list of Chaos Engineering resources.☆6,246Updated last year
- ⚙️ A Gentle introduction to Kubernetes with more than just the basics. 🌟 Give it a star if you like it.☆3,229Updated 2 years ago
- ☆3,426Updated 4 years ago
- Class materials for a distributed systems lecture series☆9,178Updated last month
- Ideas for creating and sustaining high performance organizations☆1,246Updated last month
- Curated list of resources on testing distributed systems☆2,559Updated this week
- Books for people who are or aspire to manage/lead team(s) of software engineers☆1,618Updated last year
- Prometheus-Basics is part of Prometheus Docs now, checkout 👇☆1,553Updated 4 years ago
- A collection of postmortems. Sorry for the delay in merging PRs!☆11,508Updated this week
- Heuristics for effective management☆5,365Updated 9 months ago
- A reading list for services engineering, with a focus on cloud infrastructure services☆3,643Updated 2 years ago
- A curated list of Site Reliability and Production Engineering resources.☆12,345Updated 10 months ago
- Techniques and numbers for estimating system's performance from first-principles☆4,260Updated 7 months ago
- For when people get too hyped up about things☆7,285Updated last year
- The Open Architecture Playbook. Use it to create better and faster (IT)Architectures. OSS Tools, templates and more for solving IT proble…☆722Updated last month
- A distributed, fault-tolerant pipeline for observability data☆1,743Updated last year
- Learn where some of the network sysctl variables fit into the Linux/Kernel network flow. Translations: 🇷🇺☆5,641Updated 2 months ago
- Chaos Engineering Toolkit & Orchestration for Developers☆1,925Updated 9 months ago
- Monzo's real-time incident response and reporting tool ⚡️☆1,542Updated last year
- Brad's homelab setup☆1,905Updated 5 years ago
- Debugging tool for Kubernetes which tests and displays connectivity between nodes in the cluster.☆2,598Updated 5 months ago
- ☆1,997Updated 2 years ago
- A Mighty CLI for AWS☆4,978Updated 2 years ago
- Architectural patterns of resilient distributed systems☆1,258Updated 7 years ago
- Systems and failure reading list☆198Updated 3 years ago
- High level architecture overview☆1,452Updated 2 weeks ago
- Command-line tools for working with Architecture Decision Records☆4,897Updated last year