lorin / res-eng-short-course-notes
Notes on David Woods's Resilience Engineering short course
☆42Updated 3 years ago
Alternatives and similar repositories for res-eng-short-course-notes:
Users that are interested in res-eng-short-course-notes are comparing it to the libraries listed below
- Documents and resources for the "Learning from Incidents in Software" slack workspace.☆39Updated 4 years ago
- Leading Groups at Etsy to Learn From Accidents☆250Updated 6 months ago
- Systems and failure reading list☆197Updated 3 years ago
- Definitions of a set of -ilities.☆94Updated 6 years ago
- Literature Review for Fault Detection in Distributed Systems☆60Updated 7 years ago
- Messiness reading list☆57Updated 3 years ago
- Tech Maturity measures and tracks the maturity of software over time☆97Updated last year
- ☆70Updated 5 years ago
- A 1-day training class on how to deploy a cloud native app on AWS with Terraform and shell scripts☆141Updated 8 years ago
- ☆45Updated 5 years ago
- A Chaos Engineering Bootcamp☆171Updated 6 years ago
- repositories of my talks☆280Updated 5 years ago
- A sample of major outages and incidents☆18Updated 5 years ago
- This is a collection of readings, talks, and other bits regarding the field of Resilience Engineering☆227Updated 6 years ago
- Images and links to references for Kafka Fault Tree Analysis talks by Andrey Falko☆21Updated 3 years ago
- Chaos Engineering Working Group☆113Updated 4 years ago
- Open Chaos Initiative☆31Updated 4 years ago
- A collection of the papers, conference talks, articles, blog posts, interesting Twitter threads, HN/reddit comments on systems engineerin…☆568Updated 5 years ago
- DEPRECATED. I managed to squeeze another year out of this, but moving on to a Docker platform in Calavera2. [This is a project to create…☆34Updated 7 years ago
- Growing up tech leads: notes on running a skill share for your peers.☆180Updated 6 years ago
- The Software Defined Delivery Manifesto☆133Updated 5 years ago
- How-to guide for testing the riff FaaS platform and Istio on Google Kubernetes Engine.☆97Updated 6 years ago
- High availability reading list☆50Updated 6 years ago
- Slides and resources for talks on partition tolerance☆33Updated 6 years ago
- Slide decks with editable source files☆232Updated 5 months ago
- Keynote for QCon SF 2015!☆38Updated 8 years ago
- Introduction to resilience engineering concepts for software engineers☆70Updated 5 years ago
- A curated list of well-written publicly available incident writeups☆13Updated 5 years ago
- A troposphere-inspired library for programmatic, declarative definition and management of SignalFx Charts, Dashboards, and Detectors.☆40Updated last year
- Information to run your own Chaos Day☆12Updated 5 years ago