| CPC G06F 9/5072 (2013.01) [G06F 11/3006 (2013.01); G06F 11/3055 (2013.01); G06F 2209/5015 (2013.01); G06F 2209/5016 (2013.01); G06F 2209/503 (2013.01); G06F 2209/505 (2013.01); G06F 2209/508 (2013.01)] | 20 Claims |

|
1. A computer-implemented method comprising:
at a given instance of a cluster of instances of at least one service, selecting at least one monitored instance from the cluster of instances according to a selection criterion such that each instance of the cluster of instances is selected as a monitored instance by at least one other instance of the cluster of instances, wherein the given instance, the cluster of instances, and the monitored instance are instances of a same service;
causing the given instance to detect an operational status of the at least one monitored instance; and
causing the given instance to provide, based on the operational status indicating that one of the at least one monitored instance is failed, the operational status of the failed monitored instance to a centralized controller for the cluster of instances.
|