| CPC G06F 21/6254 (2013.01) [G06F 21/121 (2013.01); G06F 21/6263 (2013.01)] | 12 Claims |

|
1. A method for automatically removing one or more data items comprising personally identifiable information (“PII”) from a collection of data items comprising clickstream data electronically communicated, the method comprising the actions of:
receiving a raw data collection of data items by a first software program running on a privacy control system, the first software program comprising a rules-based intelligence system, the raw data collection of data items being generated by a content server that is interacting with a user or application;
said first software program applying it's rules based intelligence system to parse said raw data collection and detect, in said raw data collection a first PII indicator of a particular data item, the first PII indicator being a part of the raw data collection of data items but not being separate and distinct from the particular data item;
after detecting the first PII indicator, said first software program continuing to apply it's rules based intelligence system, parsing and detecting in said raw data collection said particular data item, wherein said particular data item is within a proximity threshold associated with said first PII indicator and said particular data item has a structure that matches a structure associated with said first PII indicator depicted by the rules based intelligence system;
removing, by said first software program, said particular data item from the raw data collection to form a cleaned data collection that does not include said particular data item but retains the first PII indicator;
providing, by said first software program, said cleaned data collection to an evaluation system to evaluate the cleaned data collection; whereby the PII is not at risk of exposure to the evaluation system;
communicating, by said first software program, to a fourth software program an identifier of said particular data item and at least one associated data item detected by said first software program within a predetermined proximity from said particular data item, said associated data item not being a particular data item, said associated data item not being an indicator; and
receiving, by said first software program from said fourth software program:
an updated particular proximity associating said particular data item with an indicator.
|