US 11,983,170 B2
System and method for transaction continuity across failures in a scale-out database
Ajit Mylavarapu, Mountain View, CA (US); Vasudha Krishnaswamy, Fremont, CA (US); Sukhada Pendse, Foster City, CA (US); Solmaz Kolahi, Redwood City, CA (US); Ankita Kumar, Belleuvue, WA (US); Garret F. Swart, Palo Alto, CA (US); Tirthankar Lahiri, Palo Alto, CA (US); and Juan R. Loaiza, Woodside, CA (US)
Assigned to Oracle International Corporation, Redwood Shores, CA (US)
Filed by Oracle International Corporation, Redwood Shores, CA (US)
Filed on Mar. 6, 2023, as Appl. No. 18/117,810.
Application 18/117,810 is a continuation of application No. 17/137,745, filed on Dec. 30, 2020, granted, now 11,599,421.
Application 17/137,745 is a continuation in part of application No. 17/070,277, filed on Oct. 14, 2020, granted, now 11,550,771, issued on Jan. 10, 2023.
Prior Publication US 2023/0281190 A1, Sep. 7, 2023
Int. Cl. G06F 16/23 (2019.01); G06F 16/28 (2019.01)
CPC G06F 16/2379 (2019.01) [G06F 16/285 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
on each host of a plurality of hosts, executing one or more engine instances of a plurality of engine instances to form an engine cluster, wherein each host of the plurality of hosts is a computing device;
establishing neighbor relationships among the plurality of hosts;
executing a plurality of control instances on a first set of hosts of the plurality of hosts to form a control cluster, wherein:
each control instance executes on a distinct host of the first set of hosts,
the plurality of control instances maintain data indicating the neighbor relationships between the plurality of hosts, and
the plurality of control instances comprise a control instance leader and one or more control instance followers;
in response to a first host within the plurality of hosts indicating a failure of a neighbor host, initiating a pair-wise focused investigation to check peer-to-peer connections between the first host and the neighbor host;
in response to one or more additional hosts within the plurality of hosts indicating failures of neighbor hosts while the pair-wise focused investigation is being performed, performing a wide investigation to check connections between the control cluster and the plurality of hosts;
adding one or more hosts to an eviction list based on results of the pair-wise focused investigation or the wide investigation; and
performing an eviction protocol to evict the one or more hosts from the engine cluster using the eviction list.