Service Level Agreements and the Four Nines are Not Enough for High Availability in the Cloud

Reading Time: 2 minutes

When most people think of high availability, they set four nines (99.99%) or less than five minutes of downtime every month as the baseline. But according to Dave Bermingham, SIOS Director of Customer Success, in this TFiR video interview, high availability is more than that.

Dave argues that counting on nines is really a measurement that you might be judged against, but really trying to guarantee a level of nines is almost impossible. Because there’s so many points in that availability chain that can be a single point of failure. Four nines is certainly a great number to be judged against and to strive for, but overall it doesn’t mean a lot to have just four nines for my database server.

Effective High Availability Covers a Complex Availability Chain

Even with Cloud SLAs (Service Level Agreements), one can’t be fully rest assured as most cloud providers offer four nines on compute, which is only one part of the availability chain (along with network, storage, and the hops between). Bermingham warns, “There’s a million points of failure. So, trying to think that my cloud provider offers four nines so I’m covered, you’re kind of fooling yourself there. You have to look at the big picture and do what you can to identify those points of failures, to minimize the potential points of failure and to have a recovery plan, should something happen.”

When considering High Availability/Disaster Recovery (HA/DR), Bermingham believes the thing that causes the most visible downtime is human error. Bermingham also suggests that authorization and access to the system should also be restricted to reduce the point of failure. “You should only give access to those who absolutely need access to it and you should also ensure that they are highly trained and that you have all the things in place to help minimize potential oops.”

SIOS offers a single solution to meet both high availability and disaster recovery needs across a wide variety of operating systems (Windows, Linux), platforms, and applications, including SAP, SAP HANA, MaxDB, SQL Server, Oracle, and other environments running in SAN-based, shared storage configurations or SANless, local data storage configurations. Contact us for more information.


Recent Posts

SIOS Background

The Challenges of Using Amazon EBS Multi-Attach for Microsoft Failover Clusters

Overview of Amazon EBS and Microsoft Failover Clusters  Amazon EBS Multi-Attach volumes and Microsoft Failover Clusters are powerful tools in the world of […]

Read More

Build High Availability with a HANA 3-Node HSR Cluster in AWS Using SIOS LifeKeeper

Introduction: How to Ensure HA and DR in Your Database Creating a highly available SAP HANA environment in AWS is a critical task […]

Read More

How Manufacturers Can Use Failover Clustering to Avoid Downtime

In manufacturing, ERP systems and databases like SAP, SAP HANA, Oracle Database, and SQL Server manage various processes throughout their operations. One of […]

Read More