CPC G06F 21/64 (2013.01) | 20 Claims |
1. A system for generating security rules based on timeseries anomalies, the system comprising:
one or more processors; and
a non-transitory computer-readable storage medium storing instructions, which when executed by the one or more processors cause the one or more processors to perform operations comprising:
receiving a dataset comprising a plurality of features and a plurality of entries;
retrieving a hierarchy for segmenting the dataset, wherein the hierarchy comprises a plurality of levels and wherein each level of the hierarchy is associated with a corresponding number of features;
generating a first plurality of timeseries dataset segments based on a first level of the hierarchy and an aggregation time interval, wherein the first level of the hierarchy corresponds to a first number of features, and wherein each timeseries dataset segment of the first plurality of timeseries dataset segments is generated based on values within one or more features of the dataset;
inputting each timeseries dataset segment of the first plurality of timeseries dataset segments into an anomaly detection machine learning model to obtain a first number of anomalies for the first plurality of timeseries dataset segments;
determining that the first number of anomalies does not meet a threshold;
based on determining that the first number of anomalies does not meet the threshold, generating a second plurality of timeseries dataset segments based on a second level of the hierarchy and the aggregation time interval, wherein the second level of the hierarchy corresponds to a second number of features larger than the first number of features;
inputting each timeseries dataset segment of the second plurality of timeseries dataset segments into the anomaly detection machine learning model to obtain a second number of anomalies for the second plurality of timeseries dataset segments;
determining that the second number of anomalies meets the threshold; and
generating a corresponding security rule based on each anomaly within the second plurality of timeseries dataset segments.
|