adaptivecapacitylabs / Resilience-Engineering-ResourcesLinks
This is a collection of readings, talks, and other bits regarding the field of Resilience Engineering
☆226Updated 6 years ago
Alternatives and similar repositories for Resilience-Engineering-Resources
Users that are interested in Resilience-Engineering-Resources are comparing it to the libraries listed below
Sorting:
- repositories of my talks☆278Updated 5 years ago
- ☆70Updated 5 years ago
- Growing up tech leads: notes on running a skill share for your peers.☆179Updated 6 years ago
- A Chaos Engineering Bootcamp☆173Updated 7 years ago
- The accompanying repository for The ScalingStatefulServices talk☆127Updated 9 years ago
- The Agile Operations methodology☆146Updated 2 years ago
- A 1-day training class on how to deploy a cloud native app on AWS with Terraform and shell scripts☆141Updated 8 years ago
- Literature Review for Fault Detection in Distributed Systems☆60Updated 8 years ago
- Tips and tricks for getting through on-call☆400Updated 5 years ago
- Messiness reading list☆57Updated 3 years ago
- Requests for Discussion☆264Updated 3 weeks ago
- Systems and failure reading list☆200Updated 3 years ago
- Catalog of valuable metrics you might want to collect☆346Updated 11 years ago
- A checklist for ensuring that code reviews are thorough☆45Updated 6 years ago
- Leading Groups at Etsy to Learn From Accidents☆255Updated last year
- Interesting and useful containers usages☆195Updated 2 years ago
- High availability reading list☆50Updated 6 years ago
- Reference materials to this talk☆30Updated 8 years ago
- A tool for performing actions on GitHub repos or a single repo.☆357Updated 2 years ago
- Tech Maturity measures and tracks the maturity of software over time☆98Updated 2 years ago
- Configs and scripts for bootstrapping an opinionated Kubernetes cluster anywhere.☆396Updated 6 years ago
- For auditing what collaborators, hooks, and deploy keys you have added on all your GitHub repositories.☆335Updated 5 years ago
- Examples of OS / system limits☆308Updated 4 years ago
- A very basic showcase of an operational platform for your microservices☆53Updated 9 years ago
- Random, repeatable network fault injection☆102Updated 11 years ago
- Documents and resources for the "Learning from Incidents in Software" slack workspace.☆40Updated 4 years ago
- An API and collection system to centralize important AWS resource information across multiple accounts☆87Updated 5 years ago
- CrashCart: sideload binaries into a running container☆272Updated 7 years ago
- HumanOps deliberately highlights the importance of the teams running systems, not just the systems themselves.☆269Updated 4 years ago
- A searchable EC2 Inventory store☆98Updated 5 years ago