Using the Health Monitoring service, you can monitor the status of the following:
Database replication
File replication
LDAP replication
System health check
Application server
The system checks the condition of services on both the primary and secondary System Manager servers.
You can configure the following parameters from of the System Manager web console:
Health monitoring interval
The number of days the health monitoring data must be retained
The number of successive retries before an alarm is raised
You can configure the timeout interval for health monitoring in the MonitorConfig.properties file from System Manager CLI. The properties file is available in the $MGMT_HOME/SystemMonitor/res/ location. The default timeout interval is 15 seconds.
The health monitoring includes the overall status of the replication, and the detailed health metric such as the time and size of the data that the secondary System Manager server lags in replication behind the primary System Manager server.
You can view the heartbeat status and the health monitoring details in the graphical format for different services from View Heartbeat Status from on System Manager web console.