+

Search Tips   |   Advanced Search

Set the default failure detection protocol heartbeat periods


Overview

The default failure detection protocol relies on TCP/IP socket closing events to...

If analysis indicates excessive CPU usage is spent monitoring heartbeats we can set Heartbeat timeout period, which in general, should equal..

Heartbeat transmission period x Number of missed consecutive heartbeats

...where...

heartbeat transmission period Frequency at which a core group member sends a heartbeat packet over every established connection. Default is 30 seconds.
heartbeat timeout period Failure detection time. If no packets are received during the specified time period, a failure is declared. Default is 180 seconds.

For mixed-cell environments, use custom properties to configure...

 

Set heartbeat periods

  1. Go to...

    Servers | Core Groups | Core group settings | core_group_name | Additional Properties | Discovery and failure detection

    ...and select the option...

    Use the default protocol providers

    If not selected, do not perform any more of the steps in this task.

  2. Specify a value for...

    Heartbeat transmission period

    Default is 30000 milliseconds (30 seconds)

  3. Specify a value for...

    Heartbeat timeout period

    Default is 180000 milliseconds (180 seconds)

  4. Click OK and then click Review.

  5. Select Synchronize changes with nodes, and then click Save.

  6. Restart all of the members of the core group.

After the servers restart, the core group members all run with the new failure detection protocol settings.


Mixed cell environments

If running in a mixed cell environment, with core groups containing a mixture of V7.0 and V 6.x processes, continue to use the core group custom properties....

  1. In the admin console, click...

    Servers | Core Groups | Core group settings | core_group_name | Additional Properties | Custom properties | New

  2. In the Name field, specify values for either...

    • IBM_CS_FD_PERIOD_SECS

      How frequently the failure detection protocol checks the core group network connections that the discovery protocol establishes.

    • IBM_CS_FD_CONSECUTIVE_MISSED

      Number of consecutive heartbeats that a member can missed before it is communication with that member is discontinued.

 

Related

Core groups (high availability domains)
Core group discovery and failure detection protocols
Set up a high availability environment