US 12,271,355 B2
Semantic classification for data management
Eyal Dahari, Ganei Tikva (IL); and Michal Barak, Petach Tikva (IL)
Assigned to BMC Software, Inc., Houston, TX (US)
Filed by BMC Software, Inc., Houston, TX (US)
Filed on Sep. 30, 2022, as Appl. No. 17/937,261.
Prior Publication US 2024/0111736 A1, Apr. 4, 2024
Int. Cl. G06F 16/00 (2019.01); G06F 16/21 (2019.01); G06F 16/215 (2019.01); G06F 40/30 (2020.01)
CPC G06F 16/215 (2019.01) [G06F 16/211 (2019.01); G06F 40/30 (2020.01)] 20 Claims
OG exemplary drawing
 
1. A computer program product, the computer program product being tangibly embodied on a non-transitory computer-readable storage medium and comprising instructions that, when executed by at least one computing device, are configured to cause the at least one computing device to:
receive data having a plurality of field names, each field name associated with a plurality of field values;
access a semantic dictionary that associates each semantic type of a plurality of semantic types with a field value pattern and at least one known field name;
relate, for a selected field name of the plurality of field names, corresponding field values to each field value pattern of the semantic dictionary to determine whether the corresponding field values match any field value pattern of the semantic dictionary;
generate, when the corresponding field values do not match any field value pattern of the semantic dictionary, a candidate field value pattern based on a candidate field value of the corresponding field values;
determine that a threshold percentage of the corresponding field values conform to the candidate field value pattern; and
update the semantic dictionary with a new semantic type that is associated with the candidate field value pattern and the selected field name.