An object or an application includes two distinct aspects:
If a function is not associated with individual data or states, it is easy to achieve high availability by simply restarting this function process when the old process crashes. However, the reality is that functions are associated with individual data or state, some with persisted data in database or files, such as Entity EJBs. We need to make data management systems highly available to all processes and ensure data integrity because the failed process might damage data.
Failover refers to the single process that moves from the primary system to the backup system in the cluster. The failure recovery includes several steps:
This failover process takes several minutes after the fault is detected. This approach can be used for both function-centric or data-centric applications for both Active/Passive and Active/Active configurations.
Fail back or fallback is similar to failover, but occurs from the backup system to the primary system when the primary system is back online. For mutual takeover, because the backup node has its original application running, as shown in Figure 9-8, failing back improves the performance of both applications.