Skip to main content

🧠 SRE Metrics: Availability

Β· 5 min read
PagerTree Team
PagerTree Team

How available is your website, service, or platform? What must you monitor and measure to ensure availability? How do you translate uptime into availability? This chart has numbers that every Site Reliability Engineer (SRE) should know. Below the chart, you will find answers to commonly asked questions about SRE and associated metrics.

UptimeDowntime (Per Year)
99%3 Days : 15 Hours : 39 Minutes
99.9%8 Hour : 45 Minutes : 56 Seconds
99.99%52 Minutes : 35 Seconds
99.999%5 Minutes : 15 Seconds
99.9999%31 Seconds
99.99999%3 Seconds

🧠 The Science of On-Call

Β· 8 min read
Yuan Cheng
Co-Founder of PagerTree

Over the past decade, multiple scientific studies have confirmed what we in DevOps have known for ages: Being on-call is a pain! But just how bad is it?

🧠 Serverless Scales

Β· 4 min read
Austin Miller
Founder of PagerTree

In Part 1: What is Serverless? I talked about how one of the biggest pros to a serverless architecture is how well it scales and how high availability is baked in.

In this post I’ll go over:

  1. How a traditional highly available scalable architecture works
  2. How a scalable serverless architecture works
  3. How you can benefit from a serverless architecture