US 11,853,453 B1
	Automatic identification of clear text secrets
Ariel Simhon, Hod Hasharon (IL); Liron Hayman, Hod Hasharon (IL); Gabriel Goldman, Hod Hasharon (IL); and Yaron Moshe, Hod Hasharon (IL)
Assigned to INTUIT INC., Mountain View, CA (US)
Filed by Intuit Inc., Mountain View, CA (US)
Filed on Mar. 27, 2019, as Appl. No. 16/365,891.
Int. Cl. G06F 21/62 (2013.01); G06F 16/245 (2019.01); G06F 16/28 (2019.01)

CPC G06F 21/6245 (2013.01) [G06F 16/245 (2019.01); G06F 16/285 (2019.01)]

14 Claims

1. A method of identifying sensitive data in clear text comprising:

receiving, at a processor, clear text data;

representing, by the processor, at least a portion of the clear text data as at least one array encoding a description of at least one feature of the clear text data;

processing, by the processor, the at least one array using a clustering algorithm to determine whether the at least one array is grouped with a benign cluster or a sensitive cluster of a model, wherein processing the at least one array further includes receiving the at least one array as input and setting a corresponding Boolean value to at least one feature included in the at least one array, and wherein the at least one feature includes characteristics known to be associated with the sensitive cluster; and

in response to determining that the at least one array is grouped with the sensitive cluster, generating, by the processor, an alert indicating that the clear text data includes sensitive information.