lorin / awesome-limits
Examples of OS / system limits
☆308Updated 3 years ago
Alternatives and similar repositories for awesome-limits:
Users that are interested in awesome-limits are comparing it to the libraries listed below
- Systems and failure reading list☆198Updated 3 years ago
- A repo of links to articles, papers, conference talks, and tooling related to load management in software services.☆193Updated last year
- A collection of the papers, conference talks, articles, blog posts, interesting Twitter threads, HN/reddit comments on systems engineerin…☆568Updated 5 years ago
- This is a collection of readings, talks, and other bits regarding the field of Resilience Engineering☆227Updated 6 years ago
- A role-playing game for incident management training☆172Updated last year
- ☆69Updated 5 years ago
- Introduction to resilience engineering concepts for software engineers☆69Updated 5 years ago
- A curated list of cheatsheets for SRE☆199Updated 4 years ago
- A Kubernetes node connectivity monitoring tool☆288Updated 11 months ago
- A collection of Twilio SRE's Gameday Templates☆140Updated 4 years ago
- Curated list of resources on SLOs☆271Updated 5 months ago
- The Open Source Observability Distribution☆1,221Updated 3 years ago
- Legend builds and publishes Grafana dashboards for your services with prefilled metrics and alerts for your services.☆184Updated 2 years ago
- Most recent content for the deck of cards☆117Updated 3 years ago
- SLOs, Error windows and alerts are complicated. Here an attempt to make it easy☆130Updated last week
- A tool to track SLA, SLO and Error budgets☆389Updated 2 years ago
- mimic: Define your Deployments, Infrastructure and Configuration as a Go Code 🚀☆237Updated last year
- A small helper to generate Honeycomb traces from CI builds☆222Updated last week
- Practical introduction to Prometheus for developers.☆454Updated 11 months ago
- Notes on Site Reliability Engineering. Leave a 🌟 if you found this useful!☆204Updated 2 years ago
- Awesome list of distributed transactions☆728Updated 3 years ago
- ☆130Updated 2 years ago
- Automatically removes Cloud managed services and Kubernetes resources based on tags with TTL☆221Updated last month
- A curated list of well-written publicly available incident writeups☆13Updated 5 years ago
- The curriculum for apprentice-level engineers at FairWinds Ops☆95Updated 3 years ago
- Interpret traceroute output to show names of ASN traversed☆154Updated 4 years ago
- A collection of step by step guides for fixing common tech problems.☆128Updated 2 years ago
- A best practices checker for Kubernetes clusters. 🤠☆556Updated last week
- Exception Monitoring Service☆188Updated 2 years ago
- Prometheus rule linter/validator☆905Updated this week