CPC G06N 5/048 (2013.01) [G06N 7/01 (2023.01); G06N 20/00 (2019.01); H04L 51/212 (2022.05); H04L 63/126 (2013.01); H04L 63/1466 (2013.01); H04L 63/0236 (2013.01); H04L 63/0245 (2013.01); H04L 63/1425 (2013.01)] | 20 Claims |
1. A computer-implemented method for determining if an electronic account identifier is computer-generated, comprising:
receiving the electronic account identifier;
analyzing the electronic account identifier to determine a plurality of fragments comprising hashed or truncated identifier fragments of a predetermined character length range;
comparing the plurality of fragments to determine one or more alphanumeric features of at least one fragment;
comparing the at least one fragment with a second plurality of fragments associated with a plurality of electronic account identifiers to determine a percentile of commonness of the at least one fragment;
determining, by a computer, if the electronic account identifier is computer-generated beyond a predetermined confidence threshold based on the determined one or more features of the at least one fragment;
transmitting the determined computer generated electronic account identifier to a probabilistic classifier model, wherein the probabilistic classifier model is a trained machine learning system; and
training the probabilistic classifier model further based on the transmitted electronic account identifier.
|