| CPC G06F 21/6245 (2013.01) [G06F 16/2237 (2019.01); G06F 16/2458 (2019.01); G06F 16/2462 (2019.01); G06F 21/602 (2013.01); G16H 50/70 (2018.01); G06F 2221/2115 (2013.01)] | 10 Claims |

|
1. A computerized method for dataset verification in a zero-trust computing environment, the method comprising:
receiving a sample dataset comprising records;
generating a sample vector for the entire sample dataset normalized by the total number or records in the sample dataset;
selecting a subset of data from the sample dataset;
generating a matrix from the subset of data;
divide the matrix into a series of vectors;
generating an example vector, wherein the example vector is generated by summing the series of vectors and normalizing the sum by the number of records in the subset of data;
calculating a difference between the sample vector-set and the example vector;
when the difference is below a threshold, applying a machine learning algorithm to the subset of data; and
when the difference is above a threshold, rejecting the subset of data.
|