US 11,675,851 B2
Streaming method for the creation of multifaceted statistical distributions
Dustin Garvey, Exeter, NH (US); Brent Arthur Enck, Roseville, CA (US); Sampanna Shahaji Salunke, Dublin, CA (US); Uri Shaft, Fremont, CA (US); John Branson Bley, Chapel Hill, NC (US); Timothy Mark Frazier, Livermore, CA (US); and Sumathi Gopalakrishnan, Fremont, CA (US)
Assigned to Oracle International Corporation, Redwood Shores, CA (US)
Filed by Oracle International Corporation, Redwood Shores, CA (US)
Filed on Sep. 20, 2021, as Appl. No. 17/479,546.
Application 17/479,546 is a continuation of application No. 16/383,426, filed on Apr. 12, 2019, granted, now 11,126,667.
Claims priority of provisional application 62/748,297, filed on Oct. 19, 2018.
Prior Publication US 2022/0004579 A1, Jan. 6, 2022
Int. Cl. G06F 16/906 (2019.01); G06F 16/901 (2019.01); G06F 16/9038 (2019.01)
CPC G06F 16/906 (2019.01) [G06F 16/9024 (2019.01); G06F 16/9038 (2019.01)] 21 Claims
OG exemplary drawing
 
1. A non-transitory computer readable medium storing instructions which, when executed by one or more hardware processors, cause performance of operations comprising:
identifying, from a data stream, a first plurality of events that occur during a first time interval on a set of nodes in a computing environment;
ordering the events, in the first plurality of events, in an event distribution;
partitioning the first plurality of events into a plurality of event clusters based on respective characteristics of each event in the first plurality of events;
wherein a first event descriptor is associated with one or more events in the first event cluster of the plurality of clusters and a second event descriptor is associated with one or more events in a second event cluster of the plurality of event clusters;
responsive at least to determining that events in the first event cluster and events in the second event cluster are less than a particular distance from a center of the event data distribution:
classifying the first event cluster and the second event cluster as candidate event clusters for descriptor consolidation;
merging the first event cluster and the second event cluster into a third event cluster, wherein a third event descriptor is associated with one or more events in the third event cluster;
responsive to receiving user input requesting to view a data set comprising the third event cluster, displaying information about the third event descriptor that is associated with the one or more events in the third event cluster;
identifying a fourth event cluster and a fifth event cluster of the plurality of event clusters;
responsive at least to determining that events in the fourth event cluster and events in the fifth event cluster are more than the particular distance from the center of the event data distribution, refraining from classifying the fourth event cluster and the fifth event cluster as candidate event clusters for descriptor consolidation.