Monitoring runtime cluster and cloud status
Runtime cluster and runtime cloud owners should periodically monitor the status of their nodes to ensure that they are online.
About this task
Although node information is available in the Cluster Status panel in the Runtime Management page (Manage > Runtime Management), to ensure you receive prompt notification of problems, Boomi recommends that you write or schedule a batch script, or configure a separate monitoring tool, to do the following:
Procedure
-
Maintain a list of the IP addresses of the expected nodes. If an IP address changes, update the list of expected IP addresses.
-
For each IP address:
-
Ensure that an
<installation_directory>\bin\views\node.localhostid.datfile exists for each expected node IP address. There are other files in this directory, so the specific file name should be used. Thelocalhostidin the file name has non-ASCII characters replaced with underscores (_). For example, the IP address127.0.0.1appears asnode.127_0_0_1.dat.If a node’s
.datfile is missing, the node is not running. Configure your script or monitoring tool to raise an alert. -
Verify that the date modified timestamp for each
.datfile has been updated in the last 10 minutes.If the date modified timestamp is more than 10 minutes old, the node is unresponsive. Configure your script or monitoring tool to raise an alert.
-
Read each file and look for the presence of a property called
problem.If a
problemproperty exists in the.datfile, the node has detected a problem with the cluster. Configure your script or monitoring tool to raise an alert. See the Cluster Problems topic for more information.
If an
<installation_directory>\bin\views\node.localhostid.datfile is found for each expected node, and the file is updated regularly, and the file does not contain any problem properties, the cluster is running as expected. -