thishitshome / learning-from-incidents
☆69Updated 5 years ago
Alternatives and similar repositories for learning-from-incidents:
Users that are interested in learning-from-incidents are comparing it to the libraries listed below
- This is a collection of readings, talks, and other bits regarding the field of Resilience Engineering☆227Updated 6 years ago
- Roll all instances within a kubernetes cluster, using a zero-downtime strategy.☆51Updated 3 years ago
- Build and deploy K8Guard. Run all Make commands from this repo.☆136Updated 7 months ago
- Instant ephemeral Kubernetes clusters for development and testing☆118Updated 2 years ago
- Documents and resources for the "Learning from Incidents in Software" slack workspace.☆39Updated 4 years ago
- Enable RBAC profiles for Tiller☆59Updated 6 years ago
- A tool for managing complex enterprise Kubernetes environments as code.☆45Updated this week
- Dogscaler scales up AWS autoscale groups based on the results of a datadog query.☆16Updated last week
- ☆44Updated 6 years ago
- A collection of Twilio SRE's Gameday Templates☆140Updated 4 years ago
- In-Cluster templating for Kubernetes manifests☆69Updated 4 years ago
- kubeplay – a new way to interact with Kubernetes API from your terminal☆86Updated 6 years ago
- List of companies/organizations running Kubernetes on AWS☆110Updated 5 years ago
- A Chaos Engineering Bootcamp☆171Updated 6 years ago
- Running Terraform in Kubernetes as a controller☆88Updated 7 years ago
- Terraform InSpec Provisioner Plugin☆68Updated 6 years ago
- Messiness reading list☆57Updated 3 years ago
- KubeSanity a sanity checking framework for Kubernetes☆84Updated 7 years ago
- A CLI tool providing you with status & configuration of a Kubernetes cluster fleet☆109Updated 4 months ago
- Sidecar container for requesting dynamic Vault database secrets☆83Updated 3 weeks ago
- Code accompanying my talk at HashiDays New York, 2017☆96Updated 6 years ago
- Literature Review for Fault Detection in Distributed Systems☆60Updated 7 years ago
- A curated list of well-written publicly available incident writeups☆13Updated 5 years ago
- Nginx based Kubernetes ingress controller for AWS☆58Updated last year
- Randomly delete pods in a given namespace☆86Updated 4 years ago
- A repo of links to articles, papers, conference talks, and tooling related to load management in software services.☆191Updated last year
- A very basic showcase of an operational platform for your microservices☆53Updated 8 years ago
- k8s-trailhead is intended to be a starting point for any engineers who are interested in interacting with Kubernetes via the Golang clien…☆16Updated 7 years ago
- Expose AWS service usage and limits to Prometheus☆47Updated 11 months ago
- Automate management of Kubernetes HPAs for Deployments & ReplicationControllers☆12Updated 8 years ago