US 11,934,556 B2
Identifying sensitive content in electronic files
Sekar Krishnan, Chennai (IN)
Assigned to PAYPAL, INC., San Jose, CA (US)
Filed by PayPal, Inc., San Jose, CA (US)
Filed on Sep. 29, 2021, as Appl. No. 17/489,017.
Prior Publication US 2023/0096474 A1, Mar. 30, 2023
Int. Cl. G06F 21/00 (2013.01); G06F 16/11 (2019.01); G06F 16/16 (2019.01); G06F 21/62 (2013.01)
CPC G06F 21/6245 (2013.01) [G06F 16/116 (2019.01); G06F 16/16 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A computer system comprising:
a non-transitory memory storing instructions; and
one or more hardware processors configured to read the instructions and cause the computer system to perform operations comprising:
receiving a request to determine whether an electronic file contains sensitive content;
preprocessing the electronic file based on a type of the electronic file, wherein the preprocessing results in a first input file having a first format corresponding to a numeric category and a second input file having a second format corresponding to a textual category;
inputting the first input file to a first machine learning engine configured to receive input files having the first format;
classifying, based on an output of the first machine learning engine from the inputting the first input file, the first input file for the sensitive content;
inputting the second input file to a second machine learning engine configured to receive input files having the second format;
classifying, based on an output of the second machine learning engine from the inputting the second input file, the second input file for the sensitive content; and
generating a report based on a combination of a first output from the first machine learning engine and a second output from the second machine learning engine, wherein the report indicates items of the sensitive content that are contained in the electronic file.