Notes on Site Reliability Engineering. Leave a π if you found this useful!
β205Jul 19, 2022Updated 3 years ago
Alternatives and similar repositories for SiteReliabilityEngineering
Users that are interested in SiteReliabilityEngineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Things you should probably know to be a decent Site Reliability Engineer.β16Mar 28, 2025Updated last year
- A collection templates ported from the SRE Workbookβ44Aug 24, 2018Updated 7 years ago
- β150Nov 22, 2023Updated 2 years ago
- A curated list of Site Reliability and Production Engineering resources.β13,205Aug 28, 2025Updated 8 months ago
- A collection of questions to practice with for SRE interviewsβ1,003Jan 25, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repository includes resources which are more than sufficient to prepare for google interview if you are applying for a software engiβ¦β738Aug 18, 2022Updated 3 years ago
- Site Reliability Engineer Interview Preparation Guideβ8,918Dec 16, 2025Updated 5 months ago
- Golang SRE framework for logs, metrics, traces and events. It supports: Jaeger, Prometheus, DataDog, Opentelemetry, NewRelic, Grafanaβ18Mar 18, 2026Updated 2 months ago
- Use Github for your SSH AuthorizedKeysCommandβ12Jul 18, 2025Updated 10 months ago
- Google Site Reliability Engineering book converted in audioβ171Mar 22, 2017Updated 9 years ago
- A curated list of Site Reliability and Production Engineering Toolsβ1,458May 13, 2026Updated last week
- A complete study plan to become a Site Reliability Engineer.β1,480Oct 4, 2022Updated 3 years ago
- Curated list of good SRE interview questions.β400Aug 16, 2022Updated 3 years ago
- devops/SRE interview questionsβ97Feb 7, 2026Updated 3 months ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This GitHub repository contains a comprehensive tutorial on Site Reliability Engineering (SRE), covering topics such as SLAs, SLOs, SLIs,β¦β18Mar 23, 2025Updated last year
- Go Application Development: Tips, Tricks, and Techniques [Video], by Packt Publishingβ19Jan 30, 2023Updated 3 years ago
- A collection of SRE toolsβ63Nov 27, 2019Updated 6 years ago
- An end to end example of implementing SLOs with prometheus, grafana and Go.β142May 30, 2019Updated 6 years ago
- β14May 1, 2018Updated 8 years ago
- Genetic algorithm to solve np-complete maximization problems. Originally intended for fantasy sports.β10May 14, 2017Updated 9 years ago
- A collection of postmortem templatesβ1,436Jul 12, 2023Updated 2 years ago
- π Index for my study topicsβ63Aug 28, 2023Updated 2 years ago
- β12May 20, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repo for post, 'Baking AWS AMI with new Docker CE Using Packer'β27Oct 8, 2017Updated 8 years ago
- A vocabulary collection for SREsβ238May 15, 2022Updated 4 years ago
- windows module development for ansibleβ13Jun 26, 2015Updated 10 years ago
- Automatically self-serviced applications for ArgoCD.β25Mar 31, 2022Updated 4 years ago
- Credential classes to access Kubernetes clustersβ17Jan 1, 2026Updated 4 months ago
- A tool to track SLA, SLO and Error budgetsβ401Jan 30, 2023Updated 3 years ago
- Use Terraform to bootstrap AWS infrastructure for a Python appβ18Mar 16, 2018Updated 8 years ago
- Random SRE interview prep materialβ20Sep 21, 2020Updated 5 years ago
- An AWS Lambda Function to automate ECR cleanupβ17Sep 13, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A role-playing game for incident management trainingβ193Feb 27, 2024Updated 2 years ago
- Terraform script for production grade eks provisioningβ16Mar 27, 2020Updated 6 years ago
- A curated list of Chaos Engineering resources.β6,562Dec 28, 2023Updated 2 years ago
- Becoming a Rockstar SRE, published by packtβ42May 2, 2024Updated 2 years ago
- A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliabβ¦β9,724Nov 17, 2025Updated 6 months ago
- High throughout, unsampled tracing span buffer with streaming searchβ40Oct 13, 2021Updated 4 years ago
- sceptre wordpress configured HA mode in AWSβ10Aug 11, 2021Updated 4 years ago