| CPC G06F 16/243 (2019.01) [G06F 16/3329 (2019.01); G06F 16/35 (2019.01); G06N 20/00 (2019.01); G06F 16/374 (2019.01)] | 20 Claims |

|
1. A device, comprising:
a memory; and
a processor coupled with the memory to:
determine a frequency of occurrences of a first regex pattern of a regex list in a dataset;
create a vector, the vector specifying a vector position and a regex value for at least the first regex pattern;
adjust the regex value corresponding to the first regex pattern by addition of a predefined value for each occurrence of the first regex pattern in the dataset;
detect a set of false matches between the first regex pattern of the regex list and the dataset;
adjust, based on the set of false matches, the regex list via a machine learning model or a classification model; and
provide the vector to at least one entity recognition system.
|