| CPC H04L 63/1416 (2013.01) [G06F 16/2379 (2019.01); G06F 16/245 (2019.01)] | 20 Claims |

|
1. A method comprising:
receiving, by at least one server, a plurality of log files from a plurality of data sources, wherein each of the plurality of log files contains a domain name field;
identifying, by the at least one server, non-identical yet equivalent domain names that each include an identical consecutive character pattern;
in response to identifying at least two non-identical yet equivalent domain names parsed from data fields of the plurality of log files that include the identical consecutive character pattern,
replacing one or more of the at least two non-identical yet equivalent domain names with one or more replacement unique domain names;
identifying, by the at least one server, one or more unique domain names and eliminating one or more duplicative domain names,
wherein the one or more duplicative domain names include at least one of the one or more replacement unique domain names;
determining a credibility score for the one or more unique domain names based on a length of the one or more unique domain names and a comparison of the one or more unique domain names to a domain name corpus stored in a dictionary database; and
updating a blacklist record to include the one or more unique domain names based on the credibility score.
|