| CPC G06F 11/1497 (2013.01) [G06F 9/4856 (2013.01); G06F 9/546 (2013.01); G06F 11/1438 (2013.01); G06F 11/1448 (2013.01); G06F 11/1482 (2013.01); G06F 11/203 (2013.01); G06F 11/2041 (2013.01); G06F 11/2097 (2013.01); G06F 11/2038 (2013.01); G06F 2201/805 (2013.01); G06F 2201/82 (2013.01); G06F 2209/542 (2013.01)] | 20 Claims |

|
1. A system, comprising:
a primary host executing a primary application and running a host operating system;
one or more processors associated with the primary host, the one or more processors configured to:
intercept calls from threads of applications to the host operating system;
generate replication messages based on the intercepted calls; and
checkpoint the primary application;
one or more backup hosts executing one or more backup applications, wherein each backup host of the one or more backup hosts has a corresponding host operating system; and
one or more processors associated with the one or more backup hosts and configured to:
detect one or more missing replication messages;
transmit, to the primary host, one or more requests for retransmission of the one or more missing replication messages; and
checkpoint-restore the one or more backup applications,
wherein live migration of the primary application to the one or more backup hosts is performed in response to an event or fault.
|