lorin / resilience-engineering
Resilience engineering papers
β2,980Updated 3 weeks ago
Alternatives and similar repositories for resilience-engineering:
Users that are interested in resilience-engineering are comparing it to the libraries listed below
- βοΈ A Gentle introduction to Kubernetes with more than just the basics. π Give it a star if you like it.β3,230Updated 2 years ago
- A collection of the papers, conference talks, articles, blog posts, interesting Twitter threads, HN/reddit comments on systems engineerinβ¦β568Updated 5 years ago
- β3,428Updated 4 years ago
- Compilation of public failure/horror stories related to Kubernetesβ6,224Updated 4 years ago
- A curated list of Chaos Engineering resources.β6,217Updated last year
- Queueing theory: an introduction for software developmentβ2,149Updated this week
- A collection of postmortem templatesβ1,346Updated last year
- A collection of postmortems. Sorry for the delay in merging PRs!β11,462Updated last month
- Curated list of resources on testing distributed systemsβ2,555Updated 3 weeks ago
- Class materials for a distributed systems lecture seriesβ9,160Updated 3 weeks ago
- Docker, Kubernetes and Gravity Trainings by Gravitationalβ2,022Updated last year
- Pointers and tools for learning and day-to-day practice of engineering management & leadership.β2,327Updated 4 months ago
- A collection of debugging stories. PRs welcome (sorry for the backlog) :-)β3,797Updated 10 months ago
- A reading list for services engineering, with a focus on cloud infrastructure servicesβ3,641Updated 2 years ago
- A workbench for writing toy implementations of distributed systems.β3,227Updated 2 months ago
- For when people get too hyped up about thingsβ7,285Updated last year
- The Open Architecture Playbook. Use it to create better and faster (IT)Architectures. OSS Tools, templates and more for solving IT probleβ¦β722Updated last month
- Mega list of 1 on 1 meeting questions compiled from a variety to sourcesβ9,569Updated 2 years ago
- FlameScope is a visualization tool for exploring different time ranges as Flame Graphs.β3,050Updated last year
- A distributed, fault-tolerant pipeline for observability dataβ1,741Updated last year
- A Mighty CLI for AWSβ4,978Updated 2 years ago
- Chaos Engineering Toolkit & Orchestration for Developersβ1,918Updated 8 months ago
- What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolatiβ¦β2,553Updated 6 months ago
- High level architecture overviewβ1,451Updated 2 months ago
- Monzo's real-time incident response and reporting tool β‘οΈβ1,539Updated last year
- A curated list of Site Reliability and Production Engineering resources.β12,305Updated 10 months ago
- The Open Source Observability Distributionβ1,222Updated 3 years ago
- Book Recommendations for the Infrastructure Engineer ;)β924Updated 3 years ago
- Prometheus-Basics is part of Prometheus Docs now, checkout πβ1,552Updated 4 years ago
- Awesome list of distributed transactionsβ727Updated 3 years ago