US 11,748,364 B2
Scalable object stream query for objects in a distributed storage system
Huamin Chen, Westford, MA (US); and Yehuda Sadeh-Weinraub, Los Angeles, CA (US)
Assigned to Red Hat, Inc., Raleigh, NC (US)
Filed by Red Hat, Inc., Raleigh, NC (US)
Filed on May 25, 2021, as Appl. No. 17/330,275.
Prior Publication US 2022/0382754 A1, Dec. 1, 2022
Int. Cl. G06F 16/2458 (2019.01); G06F 16/28 (2019.01); G06F 16/21 (2019.01); G06F 16/242 (2019.01)
CPC G06F 16/2471 (2019.01) [G06F 16/219 (2019.01); G06F 16/2433 (2019.01); G06F 16/289 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
receiving, by a distributed storage system, a request from a client to execute a query with respect to data stored at the distributed storage system, wherein the query specifies a duration parameter and an aggregation parameter, the duration parameter specifying a window of time;
executing the query to produce a result object, wherein the result object comprises data resulting from the executing the query;
storing the result object at the distributed storage system;
transmitting the result object to the client;
re-executing the query at a subsequent point in time to update the result object, wherein the result object is divided into a plurality of portions, and wherein each portion of the plurality of portions is associated with one version number of a plurality of version numbers;
responsive to updating a portion of the plurality of portions of the result object, updating a version number associated with the portion of the result object;
identifying, based on a corresponding version number, a most recently updated portion of the result object;
aggregating, according to the duration parameter and the aggregation parameter, the most recently updated portion into the updated result object; and
transmitting, to the client, the updated result object.