CPC G06F 16/2379 (2019.01) [G06F 16/285 (2019.01)] | 20 Claims |
1. A method comprising:
on each host of a plurality of hosts, executing one or more engine instances of a plurality of engine instances to form an engine cluster, wherein each host of the plurality of hosts is a computing device;
establishing neighbor relationships among the plurality of hosts;
executing a plurality of control instances on a first set of hosts of the plurality of hosts to form a control cluster, wherein:
each control instance executes on a distinct host of the first set of hosts,
the plurality of control instances maintain data indicating the neighbor relationships between the plurality of hosts, and
the plurality of control instances comprise a control instance leader and one or more control instance followers;
in response to a first host within the plurality of hosts indicating a failure of a neighbor host, initiating a pair-wise focused investigation to check peer-to-peer connections between the first host and the neighbor host;
in response to one or more additional hosts within the plurality of hosts indicating failures of neighbor hosts while the pair-wise focused investigation is being performed, performing a wide investigation to check connections between the control cluster and the plurality of hosts;
adding one or more hosts to an eviction list based on results of the pair-wise focused investigation or the wide investigation; and
performing an eviction protocol to evict the one or more hosts from the engine cluster using the eviction list.
|