| CPC H04L 63/1483 (2013.01) [G06F 16/9577 (2019.01); G06F 40/205 (2020.01); G06N 20/00 (2019.01); H04L 61/4511 (2022.05); H04L 63/1416 (2013.01); H04L 63/1425 (2013.01)] | 20 Claims |

|
1. A computing platform, comprising:
at least one processor;
a communication interface communicatively coupled to the at least one processor; and
memory storing computer-readable instructions that, when executed by the at least one processor, cause the computing platform to:
determine feature vectors corresponding to a tag structure of one or more pages associated with a first domain;
compare the feature vectors corresponding to the tag structure to the feature vectors corresponding to known legitimate domains of a baseline dataset, resulting in one or more structure analysis values comprising averages of top-N similarity scores for a plurality of selected N values; and
based on determining that the one or more structure analysis values exceed one or more predetermined structure analysis threshold values, send one or more commands directing a domain identification system to remove the first domain from a list of indeterminate domains maintained by the domain identification system.
|