| CPC G06F 9/4416 (2013.01) [G06F 16/24552 (2019.01); H04L 67/10 (2013.01)] | 20 Claims |

|
1. A method, comprising:
executing, by a distributed computing system providing a data processing service, a first computing cluster comprising a first set of computing nodes;
receiving, by a computing node in the first set of computing nodes comprising the first computing cluster, a query for execution;
determining, by the computing node in the first set of computing nodes comprising the first computing cluster, that one or more data segments for executing the query are present in a cache associated with the computing node;
executing, by the computing node in the first set of computing nodes comprising the first computing cluster, the query using the one or more data segments to obtain one or more updated data segments;
writing, by the computing node in the first set of computing nodes comprising the first computing cluster, the one or more updated data segments to a nearline storage system associated with the distributed computing system;
receiving, by the distributed computing system, a request to create a second computing cluster comprising a second set of computing nodes in the distributed computing system; and
responsive to the request, bootstrapping, by the distributed computing system, the second computing cluster using the one or more updated data segments stored in the nearline storage system.
|