{"id":744,"date":"2014-05-23T13:46:18","date_gmt":"2014-05-23T20:46:18","guid":{"rendered":"http:\/\/itiltopia.com\/?p=744"},"modified":"2017-03-02T21:50:30","modified_gmt":"2017-03-03T05:50:30","slug":"availability-vs-reliability","status":"publish","type":"post","link":"http:\/\/itiltopia.com\/?p=744","title":{"rendered":"Availability vs Reliability"},"content":{"rendered":"<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\"><a href=\"http:\/\/itiltopia.com\/wp-content\/uploads\/2014\/05\/timexz.jpg\"><img loading=\"lazy\" class=\"alignright size-full wp-image-751\" src=\"http:\/\/itiltopia.com\/wp-content\/uploads\/2014\/05\/timexz.jpg\" alt=\"Timex\" width=\"400\" height=\"374\" srcset=\"http:\/\/itiltopia.com\/wp-content\/uploads\/2014\/05\/timexz.jpg 400w, http:\/\/itiltopia.com\/wp-content\/uploads\/2014\/05\/timexz-300x280.jpg 300w\" sizes=\"(max-width: 400px) 100vw, 400px\" \/><\/a>IT prides itself on ensuring that our services are highly available. We measure availability to ridiculously high levels of precision. Indeed, we have a magical number that represents availability nirvana. We call it \u201cFive Nines\u201d.<\/span><\/span><\/span><\/p>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">The term Five Nines indicates that our systems are available for 99.999% of the agreed time. <\/span><\/span><\/span><\/p>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">Notice that I said \u201cagreed time\u201d. If our users agree to a planned outage of 120 hours for system maintenance every week, meeting that Five Nines of availability becomes much easier to achieve. Of course, generally our users don\u2019t give us the privilege of taking systems down for extended periods on a frequent basis, so we have to eek out a few hours here and there to do our periodic house-keeping.<\/span><\/span><\/span><\/p>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">To give you an example of how reliable a system that meets Five Nines of availability is, if you were to calculate the downtime for a service that is agreed to be up 24 hours a day, 7 days a week, the total unplanned\u00a0outage time over the course of a year would need to be less than 5 minutes and 15 seconds. That is seriously expensive to do.<\/span><\/span><\/span><\/p>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">Some organizations don\u2019t allow for even a single moment of downtime because their systems are so heavily relied upon. They go to extreme measures to ensure the highest availability that money can buy. I know of one organization that claims it loses\u00a0a million dollars in revenue for every minute their most critical system goes down. That company\u00a0is American Express, and the system would be their credit card transaction processing service.<\/span><\/span><\/span><\/p>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">Fortunately, most of us don\u2019t work for companies that have such stringent standards for high availability, but that doesn\u2019t mean we don\u2019t feel the heat when the services do fail.<\/span><\/span><\/span><\/p>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">ITIL teaches us that there are two aspects we need to measure when addressing the business\u2019 requirements:<\/span><\/span><\/span><\/p>\n<ul>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">Availability or Uptime &#8211; Typically expressed as a % of time (e.g., \u201cWe were up 99.5% of the agreed service time!\u201d)<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">Reliability or Frequency of Outage &#8211; Rarely measured or even mentioned in the Service Level documentation<\/span><\/span><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">And the sad thing is &#8211; users care more about Reliability. Let\u2019s look at two examples. Let\u2019s assume that we have an agreed service time over the course of a month of 43,200 minutes (30*24*60). <\/span><\/span><\/span><\/p>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">In January we had 2 outages:<\/span><\/span><\/span><\/p>\n<ul>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">1\/15 &#8211; Down for 24 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">1\/22 &#8211; Down for 36 minutes<\/span><\/span><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">In February we had 10 outages:<\/span><\/span><\/span><\/p>\n<ul>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/3 &#8211; Down for 4 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/5 &#8211; Down for 8 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/7 &#8211; Down for 6 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/12 &#8211; Down for 2 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/15 &#8211; Down for 9 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/20 &#8211; Down for 8 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/22 &#8211; Down for 5 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/24 &#8211; Down for 3 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/26 &#8211; Down for 9 minutes<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">2\/28 &#8211; Down for 6 minutes<\/span><\/span><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">The Availability calculations for the two months are exactly equal:\u00a0 99.861% Uptime.<\/span><\/span><\/span><\/p>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">But the Reliability for the two months is dramatically different: 2 outages vs 10 outages.<\/span><\/span><\/span><\/p>\n<p><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">Which month do you think would most upset the users?<\/span><\/span><\/span><\/p>\n<ul>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">January User: Well, there were a couple of outages this month, but life went on.<\/span><\/span><\/span><\/li>\n<li><span style=\"font-family: Calibri;\"><span style=\"font-size: medium;\"><span style=\"color: #000000;\">February User: G@#DA&amp;M, MOTHER#$$^ING\u00a0COMPUTER DEPARTMENT\u00a0SUCKS! I WILL KILL YOU ALL! YOU SHOULD ALL BE FIRED YOU INCOMPETANT BAS%#RDS!<\/span><\/span><\/span><\/li>\n<\/ul>\n<p><span style=\"font-size: medium;\"><span style=\"color: #000000;\"><span style=\"font-family: Calibri;\">Yet for some reason we only measure and report on the aspect that is of lower importance to the users. Interesting.<\/span><\/span><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>IT prides itself on ensuring that our services are highly available. We measure availability to ridiculously high levels of precision. Indeed, we have a magical number that represents availability nirvana. We call it \u201cFive Nines\u201d. The term Five Nines indicates that our systems are available for 99.999% of the agreed time. Notice that I said &hellip;<br \/><a href=\"http:\/\/itiltopia.com\/?p=744\">Read more <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":751,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false},"categories":[13],"tags":[],"jetpack_featured_media_url":"http:\/\/itiltopia.com\/wp-content\/uploads\/2014\/05\/timexz.jpg","jetpack_publicize_connections":[],"_links":{"self":[{"href":"http:\/\/itiltopia.com\/index.php?rest_route=\/wp\/v2\/posts\/744"}],"collection":[{"href":"http:\/\/itiltopia.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/itiltopia.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/itiltopia.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/itiltopia.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=744"}],"version-history":[{"count":23,"href":"http:\/\/itiltopia.com\/index.php?rest_route=\/wp\/v2\/posts\/744\/revisions"}],"predecessor-version":[{"id":1271,"href":"http:\/\/itiltopia.com\/index.php?rest_route=\/wp\/v2\/posts\/744\/revisions\/1271"}],"wp:featuredmedia":[{"embeddable":true,"href":"http:\/\/itiltopia.com\/index.php?rest_route=\/wp\/v2\/media\/751"}],"wp:attachment":[{"href":"http:\/\/itiltopia.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=744"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/itiltopia.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=744"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/itiltopia.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=744"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}