US 11,941,147 B2
Detection of personally identifiable information
Victor De Vansa Vikramaratne, Sunnyvale, CA (US); and Kave Eshghi, Los Altos, CA (US)
Assigned to Box, Inc., Redwood City, CA (US)
Filed by Box, Inc., Redwood City, CA (US)
Filed on Aug. 31, 2021, as Appl. No. 17/463,372.
Prior Publication US 2023/0064482 A1, Mar. 2, 2023
Int. Cl. G06F 21/62 (2013.01)
CPC G06F 21/6245 (2013.01) 23 Claims
OG exemplary drawing
 
1. A method for detecting PII (personally identifiable information), the method comprising:
maintaining a plurality of PII detectors comprising first and second detector types, wherein the first detector type is different from the second detector type, and the second detector type incurs a greater computational cost than the first detector type when processing identical content;
determining whether to process a particular content object using at least the second detector type, by:
performing regular expression analysis using the first detector type by evaluating a regular expression against the particular content object;
determining whether the second detector type is to be used on the particular content object based on at least a result of performing the regular expression analysis using the first detector type; and
when it is determined that the second detector type is to be used on the particular content object, incurring the greater computational cost of the second detector type by performing, using the second detector type, content analysis that is different from the regular expression analysis, wherein the regular expression is updated based on at least a portion of the particular content object identified using the second detector type, and the portion of the particular content object is related to a location in the particular content object identified by the first detector type; and
when it is determined that the second detector type is not to be used, avoiding incurring the greater computations cost of the second detector type by avoiding invocation of the second detector type.