CPC G06F 21/6245 (2013.01) [G06F 16/116 (2019.01); G06F 16/16 (2019.01)] | 20 Claims |
1. A computer system comprising:
a non-transitory memory storing instructions; and
one or more hardware processors configured to read the instructions and cause the computer system to perform operations comprising:
receiving a request to determine whether an electronic file contains sensitive content;
preprocessing the electronic file based on a type of the electronic file, wherein the preprocessing results in a first input file having a first format corresponding to a numeric category and a second input file having a second format corresponding to a textual category;
inputting the first input file to a first machine learning engine configured to receive input files having the first format;
classifying, based on an output of the first machine learning engine from the inputting the first input file, the first input file for the sensitive content;
inputting the second input file to a second machine learning engine configured to receive input files having the second format;
classifying, based on an output of the second machine learning engine from the inputting the second input file, the second input file for the sensitive content; and
generating a report based on a combination of a first output from the first machine learning engine and a second output from the second machine learning engine, wherein the report indicates items of the sensitive content that are contained in the electronic file.
|