About the Health Monitoring service

Last Updated : May 28, 2014 |

Using the Health Monitoring service, you can monitor the status of the following:

  • Database replication

  • File replication

  • LDAP replication

  • System health check

  • Application server

The system checks the condition of services on both the primary and secondary System Manager servers.

You can configure the following parameters from Services > Configurations > Settings > SMGR > HealthMonitor of the System Manager web console:

  • Health monitoring interval

  • The number of days the health monitoring data must be retained

  • The number of successive retries before an alarm is raised

You can configure the timeout interval for health monitoring in the MonitorConfig.properties file from System Manager CLI. The properties file is available in the $MGMT_HOME/SystemMonitor/res/ location. The default timeout interval is 15 seconds.

The health monitoring includes the overall status of the replication, and the detailed health metric such as the time and size of the data that the secondary System Manager server lags in replication behind the primary System Manager server.

You can view the heartbeat status and the health monitoring details in the graphical format for different services from View Heartbeat Status from Services > Geographic Redundancy > GR Health on System Manager web console.