US 12,277,105 B2
Methods and systems for improved search for data loss prevention
Hans-Joachim Lotzer, Bern (CH); and Klaus Gerhard Haller, Bern (CH)
Assigned to SWISSCOM AG, Bern (CH)
Filed by Swisscom AG, Bern (CH)
Filed on Mar. 6, 2023, as Appl. No. 18/117,826.
Application 18/117,826 is a continuation of application No. 15/723,883, filed on Oct. 3, 2017, granted, now 11,609,897.
Claims priority of application No. 1309/16 (CH), filed on Oct. 3, 2016.
Prior Publication US 2023/0205755 A1, Jun. 29, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/23 (2019.01); G06F 16/2455 (2019.01); G06F 21/62 (2013.01)
CPC G06F 16/2365 (2019.01) [G06F 16/24553 (2019.01); G06F 21/6227 (2013.01); G06F 21/6245 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
applying data loss prevention to data that comprises a plurality of records and a plurality of categories, wherein each of the plurality of records comprises a plurality of fields and each of the plurality of fields corresponds to a different one of the plurality of categories, and wherein the applying of data loss prevention comprises:
selecting a subset of records from the plurality of records of the data, the selected subset of records comprising fewer records than the plurality of records;
scanning fields of the selected subset of records for sensitive information;
computing based on a result of the scanning, for each category, a likelihood the category contains the sensitive information, wherein the computing comprises, for at least one category:
computing likelihoods of different types of sensitive information contained in the at least one category; and
combining the computed likelihoods to generate a likelihood to contain any sensitive information in the at least one category;
selecting a subset of categories based on the threshold and the computed likelihoods of the categories to contain the sensitive information, the subset of categories comprising fewer categories than the plurality of categories;
searching the sensitive information in the selected subset of categories; and
in response to detection of sensitive information in at least one of the selected subset of records, taking one or more data loss prevention related actions.