| CPC G06F 16/215 (2019.01) [G06F 16/211 (2019.01); G06F 40/30 (2020.01)] | 20 Claims |

|
1. A computer program product, the computer program product being tangibly embodied on a non-transitory computer-readable storage medium and comprising instructions that, when executed by at least one computing device, are configured to cause the at least one computing device to:
receive data having a plurality of field names, each field name associated with a plurality of field values;
access a semantic dictionary that associates each semantic type of a plurality of semantic types with a field value pattern and at least one known field name;
relate, for a selected field name of the plurality of field names, corresponding field values to each field value pattern of the semantic dictionary to determine whether the corresponding field values match any field value pattern of the semantic dictionary;
generate, when the corresponding field values do not match any field value pattern of the semantic dictionary, a candidate field value pattern based on a candidate field value of the corresponding field values;
determine that a threshold percentage of the corresponding field values conform to the candidate field value pattern; and
update the semantic dictionary with a new semantic type that is associated with the candidate field value pattern and the selected field name.
|