| CPC G06F 21/6218 (2013.01) [G06F 16/9538 (2019.01)] | 20 Claims |

|
1. A method comprising:
storing a plurality of search keywords associated with an entity, wherein the plurality of search keywords comprises search keywords describing characteristics of the entity;
generating a plurality of search terms based on the plurality of search keywords, wherein each search term of the plurality of search terms comprises a different combination of one or more search keywords from the plurality of search keywords;
generating a set of search results by searching one or more online public databases based on each of the one or more search terms, wherein the set of search results comprises a set of data files stored by the one or more online public databases;
generating a risk score for each data file in the set of data files by applying a risk scoring model to the set of search results, wherein a risk score for a data file indicates a likelihood that the data file comprises information that came from a data breach associated with the entity;
identifying, based on the generated risk scores for the set of data files, one or more data files of the set of data files that contain information from the data breach; and
transmitting a notification to the entity describing the identified one or more data files.
|