Nodes & Clusters: Key Components of High Availability

Reading Time: 4 minutes

I wanted to spend some time reviewing the terms “nodes” and “clusters.” For the purposes of this blog, I will explain how SIOS uses these terms and others and what they specifically mean.

These might be considered standard terms in the world of distributed computing, but if you are new to the field, you may wonder exactly what they mean.

What Are Nodes in Distributed Computing?

When I started with SIOS, I noticed that the terms node and cluster were common, everyday words that you would hear many times daily. I kept asking myself, ‘Why are they using the word “node”’? It sounds from the context that they mean server, but why do they say node? To explain, a node can be a server, but it can also be a client computer or a peer; it is essentially any component used to perform computing duties and route traffic.

In Amazon Web Services (AWS), a node can be a virtual machine implemented as an EC2 instance. You can install and run software on it, and it can have a network interface that can be used to communicate with it and for it to connect to other nodes. When you SSH into an AWS EC2 instance, the client computer from which you are launching your SSH session is an example of a client node, and you are connecting to an EC2 server instance node. Nodes can be a physical machine on-premises or a virtual machine (VM).

Understanding Clusters: How Nodes Work Together

Let’s move on to the term “cluster”. This word might make one think of things that are stuck together. In the distributed computing world, this means nodes that are linked together to form a combined resource that might handle a bigger task than a single node can handle. At SIOS, we have special cluster protection software on each node that monitors the volumes and can launch failover operations when problems are detected or respond to resources being intentionally taken in and out of service by a user.

You might link nodes together in a cluster to perform automatic backups. You could run a database server on a separate node to isolate the computing power/disk I/O and the data from other operations.

The Role of Redundancy in High-Availability Clusters

Clusters can also provide redundancy to allow services to remain up when one node fails. Redundancy of operation is not a new concept. The days of running any vital operation on a single server that has no redundancy are hopefully well behind us.

For example, in the blade-computing world, redundancy is facilitated in a blade server configuration by running two computing modules within the same unit. The server firmware handles the failover/switchover logic. Power supplies, and rack KVM, are shared amongst the load of server hardware for cost savings.

Facility operators may add more hardware to a server in an incremental fashion to handle extra load. This allows an operator to right-size their system and purchase / build it using standardized components from the rack manufacturer. This provides a more limited but similar scaling mechanism to that in the cloud world, the difference being that it is all hosted in one box. On-premises rack hardware such as this or similar can be used to construct clustered nodes.

Cloud-Based Clusters vs. On-Premises Clusters

Cloud clusters benefit from all of the attributes of redundancy built into rack server equipment, as they are basically discrete VMs that run on shared data-center hardware owned by the cloud provider. However, they permit the customer to spread their clusters over different locations, intentionally load-slicing their computer needs into VM’s running in different physical buildings in other areas of the cloud provider’s physical data centers.

This provides an enormous resiliency to single-site outages. A cluster implemented in the cloud utilizing servers in various locations can tolerate complete power loss to one location.

Nodes and Clusters Explained

Some questions that come up:

Q. Is a cluster the same as a node?

A. No, a node is typically one component that can perform computer duties. A cluster consists of 2 or more nodes.

Q. What is a 3-node cluster?

A. A 3-node cluster is a cluster of 3 nodes with communication paths between each of the respective nodes. 3 nodes, being an odd-numbered configuration, typically one of the nodes will be a so-called ‘witness’ node and may not perform other work. In the event of a partially failed network, and a node being unable to communicate with its peer, the two main server nodes may not be able to determine who should take control (this phenomenon is called ‘split-brain’). A witness node can offer information on what nodes it can see are in service, providing data to resolve the split-brain to bring up one active node and put the other node into standby mode, regaining correct control of the nodes.

Q. What is 2 node cluster?

A. A 2 node cluster is a cluster of 2 nodes with one or more communication paths between them. This is typically used to run services on a primary node and have the second node on standby.

Q. How many nodes make a cluster?

A. 2 or more nodes make a cluster.

Maximizing High Availability with Nodes and Clusters

In summary, clusters are formed from nodes; a node is an independent computing module with networking capabilities. Be aware of the benefits of putting your nodes in different physical locations to guard against downtime in one area.

Contact SIOS today to learn how our clustering solutions can help you optimize high availability and minimize downtime.

Author: Paul Scrutton
Principal Software Engineer at SIOS

What We Do

The SIOS Advantage

Products & Services

Learn about automated SAP HANA Multitarget DR

Not Sure What You Need?

Solutions

Blog

Blog Categories

Recent Posts

Resources

Resource Library

Company

SIOS in the news

Nodes and Clusters: The Building Blocks of High Availability