CPC G06F 16/243 (2019.01) [G06F 16/3329 (2019.01); G06F 16/35 (2019.01); G06N 20/00 (2019.01); G06F 16/374 (2019.01)] | 12 Claims |
1. A method comprising:
determining, via one or more processors, whether a dataset comprises unstructured text;
determining, via the one or more processors, that at least a portion of the unstructured text corresponds to a regex pattern of a regex list, wherein the regex pattern comprises at least one metacharacter, the at least one metacharacter associated with a non-literal meaning;
replacing, via the one or more processors, the portion of the unstructured text with an encoding that represents the regex pattern to generate a modified dataset; and
providing at least the modified dataset to at least one entity recognition system.
|