US 11,657,168 B2
Error detection of data leakage in a data processing system
Ramesh Alagar, Tamil Nadu (IN); and Prabakar Rangarajan, Tamil Nadu (IN)
Assigned to Bank of America Corporation, Charlotte, NC (US)
Filed by Bank of America Corporation, Charlotte, NC (US)
Filed on Nov. 30, 2020, as Appl. No. 17/106,421.
Application 17/106,421 is a continuation of application No. 16/111,438, filed on Aug. 24, 2018, granted, now 10,853,509.
Prior Publication US 2021/0081554 A1, Mar. 18, 2021
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 21/00 (2013.01); G06F 21/62 (2013.01); G06F 16/28 (2019.01); G06F 16/951 (2019.01); G06F 16/23 (2019.01)
CPC G06F 21/62 (2013.01) [G06F 16/2365 (2019.01); G06F 16/285 (2019.01); G06F 16/951 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A system for detecting data leakage, comprising:
a memory configured to store:
a plurality of allow tables, each allow table comprising a plurality of allowed data elements of a pre-defined data type that are associated with a plurality of users;
a web interceptor configured to:
intercept web traffic data associated with the plurality of users, the web traffic data comprising a plurality of data elements, each data element being associated with a pre-defined data type;
a clustering engine configured to:
group the plurality of data elements into a plurality of clusters, each cluster corresponding to a pre-defined data type; and
a leakage detection engine configured to:
identify a first cluster of data elements associated with a first pre-defined data type;
identify data elements in the first cluster that are associated with a first user;
identify a first allow table that is associated with the first pre-defined data type;
identify allowed data elements in the first allow table that are associated with the first user;
compare the identified data elements in the first cluster to the identified allowed data elements in the first allow table;
determine whether the identified data elements in the first cluster comprises at least one data element that is not in the identified allowed data elements in the first allow table; and
in response to determining that the identified data elements in the first cluster comprises at least one data element that is not in the identified allowed data elements in the first allow table, determine that an error of data leakage is detected for the first user within the first pre-defined data type.