| CPC H04L 63/1483 (2013.01) [G06F 40/205 (2020.01); G06F 40/279 (2020.01); H04L 61/3005 (2013.01); H04L 61/4511 (2022.05)] | 20 Claims |

|
1. A method for domain processing, the method comprising:
for each respective candidate domain of a plurality of candidate domains:
comparing, by a computer, a seed domain and the respective candidate domain, the comparing including a character match count, a first string length of the seed domain, and a second string length of the respective candidate domain;
generating, by the computer, a similarity score based on the character match count, the first string length of the seed domain, and the second string length of the respective candidate domain;
computing, by the computer, a dynamic threshold based on the first string length of the seed domain and the second string length of the respective candidate domain;
determining, by the computer, whether the similarity score exceeds the dynamic threshold; and
responsive to the similarity score not exceeding the dynamic threshold, removing the respective candidate domain from the plurality of candidate domains.
|