| CPC G06N 5/04 (2013.01) [G06N 20/00 (2019.01)] | 30 Claims |

|
1. A non-transitory computer-readable medium having computer-readable instructions stored thereon that when executed by a processor cause the processor to:
receive input data comprising input streaming data from one or more network devices as part of a data analytics project for analyzing the input data based on at least one of a cause or effect relationship that each variable of a plurality of variables of the input data has with one or more other variables of the plurality of variables, wherein the input data comprises a plurality of observation vectors, each of the plurality of observation vectors comprising variable values of the plurality of variables, wherein a number of the plurality of variables in each of the plurality of observation vectors is n, and wherein each of the plurality of variables is associated with a unique variable index;
determine the at least one of the cause or effect relationship that each variable of the plurality of variables has with one or more other variables of the plurality of variables by generating, using machine learning, a topological order of a directed acyclic graph (DAG) by:
(A) creating a plurality of residual series vectors, each of the plurality of residual series vectors associated with one variable of the plurality of variables;
(B) calculating a normality statistic value for each of the plurality of residual series vectors to obtain a plurality of normality statistic values;
(C) calculating a mean squared error value for each of the plurality of residual series vectors;
(D) comparing each of the plurality of normality statistic values with a predefined critical value;
(E) for each value of the plurality of normality statistic values that is less than or equal to the predefined critical value, adding (a) the variable index of the variable of the plurality of variables associated with the value to an empty temporary order list; and (b) the mean squared error value of the variable of the plurality of variables associated with the value to an empty mean squared error list;
(F) counting a number of elements in the temporary order list;
(G) responsive to determining that the number of elements in the temporary order list is equal to zero, updating an order list based on the plurality of normality statistic values or responsive to determining that the number of elements in the temporary order list is not equal to zero, updating the order list based on at least one of the temporary order list or the mean squared error list;
(H) repeating (A) through (H) a plurality of times; and
(I) outputting the order list from (G) as the topological order of the DAG, the topological order of the DAG being used to transform the input streaming data into one or more meaningful output data streams for analyzing the input data by one or more users as part of the data analytics project, wherein the topological order of the DAG is indicative of the at least one of the cause or effect relationship that each variable of the plurality of variables has with the one or more other variables of the plurality of variables; and
analyze the input data based on the one or more meaningful output data streams.
|