CPC G06F 16/24568 (2019.01) [G06F 16/2456 (2019.01); G06Q 50/04 (2013.01)] | 16 Claims |
1. A method for execution by a processing system of a node, comprising:
determining, by at least one processor of the processing system, at least one query for execution;
in response to the determining the at least one query:
determining, by the at least one processor of the processing system, a set of segments that include a set of rows required to execute the at least one query;
determining, by the at least one processor of the processing system, at least one virtual segment in the set of segments, wherein a first segment of the at least one virtual segment includes a plurality of sequential data portions, and wherein the first segment is included in a segment group generated by utilizing a redundancy coding scheme;
identifying, by the at least one processor of the processing system, at least two data portions in the at least one virtual segment, wherein the at least two data portions are a proper subset of the plurality of sequential data portions;
generating, by the at least one processor of the processing system, a locally rebuilt set of rows for the at least two data portions by utilizing a recovery scheme, wherein each sequential data portion of the plurality of sequential data portions is independently recoverable via the recovery scheme, and wherein the generating the locally rebuilt set of rows includes:
determining a plurality of segments in the segment group required to rebuild the first segment, wherein each segment of the plurality of segments includes a plurality of corresponding data portions;
identifying, for each segment of the plurality of segments, the at least two data portions of the corresponding data portions of the each segment of the plurality of segments required to rebuild the at least two data portions of the first segment;
retrieving the at least two data portions of the corresponding data portions of the each segment of the plurality of segments from a corresponding one of a set of memory drives, wherein the retrieving the at least two data portions of the corresponding data portions of the each segment of the plurality of segments includes generating a set of read requests to each corresponding one of the set of memory drives by batching a plurality of read requests to the each corresponding one of the set of memory drives, and wherein each read request of the plurality of read requests corresponds to one data portion of the at least two data portions of the corresponding each segment of the plurality of segments; and
utilizing the retrieved at least two data portions of the corresponding data portions of the each segment of the plurality of segments to rebuild the at least two data portions based on the redundancy coding scheme; and
generating, by the at least one processor of the processing system, query resultant data based on performing execution of the at least one query by utilizing the locally rebuilt set of rows.
|