michaelkkehoe / awesome-sre-cheatsheets
A curated list of cheatsheets for SRE
☆197Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-sre-cheatsheets
- A vocabulary collection for SREs☆203Updated 2 years ago
- A repo of links to articles, papers, conference talks, and tooling related to load management in software services.☆190Updated last year
- Notes on Site Reliability Engineering. Leave a 🌟 if you found this useful!☆200Updated 2 years ago
- The curriculum for apprentice-level engineers at FairWinds Ops☆95Updated 3 years ago
- What to Read to Learn More About DevOps☆453Updated 2 years ago
- Curated list of resources on SLOs☆268Updated last month
- A collection of questions to practice with for SRE interviews☆919Updated 2 years ago
- A Chaos Engineering Bootcamp☆170Updated 6 years ago
- FB preparation materials☆93Updated 6 years ago
- A reading and viewing list for larval stage SREs and sysadmins☆549Updated 2 weeks ago
- Challenges Your Kubernetes Skills And Knowledge☆231Updated 5 years ago
- A list of some of the questions which I've had to know during linux syadmin / devops interviews.☆147Updated 8 years ago
- The Agile Operations methodology☆142Updated last year
- A collection of Twilio SRE's Gameday Templates☆141Updated 4 years ago
- Google Site Reliability Engineering book converted in audio☆161Updated 7 years ago
- ☆148Updated 11 months ago
- Guidance on how to make your environment easier to onboard for Web Ops Engineers, SRE's and DevOps Practitioners☆280Updated 4 months ago
- Notes from the book Kubernetes Up and Running☆98Updated 6 years ago
- SLOs, Error windows and alerts are complicated. Here an attempt to make it easy☆131Updated last year
- PromQL cheat sheet - Usage and examples of basics, aggregations & functions☆57Updated last year
- Examples of OS / system limits☆294Updated 3 years ago
- A collection of step by step guides for fixing common tech problems.☆129Updated last year
- Sample code for the talk "How to test your infrastructure code: automated testing for Terraform, Docker, Packer, Kubernetes, and more" by…☆186Updated last year
- My opinionated list of products and tools used for high-scalability projects☆66Updated last year
- ☆109Updated 2 years ago
- Legend builds and publishes Grafana dashboards for your services with prefilled metrics and alerts for your services.☆184Updated last year
- Curated list of good SRE interview questions.☆373Updated 2 years ago
- A library of rules for Conftest used to detect misconfigurations within Terraform configuration files☆190Updated 2 years ago