| CPC G06F 18/2431 (2023.01) [G06F 16/285 (2019.01)] | 6 Claims |
|
1. A data-pattern classification method for a database containing character strings regularly overlapping between input values of mixed columns and information inputted in another specific column, the mixed column including types of information,
the method comprising:
extracting a branch column for changing a type of information to be inputted to the mixed column, the mixed column including two or more different types of information, the branch column being extracted by one of a heuristic first technique and a second technique, the first technique using timing to change a pattern of the mixed column in the database including the mixed column, the pattern referring to a column where a character string overlaps the input value of the mixed column, the second technique using a statistical technique of a likelihood test; and
classifying to obtain the number of types of information stored in the mixed columns, by grouping, according to information indicated by the patterns, the patterns obtained from the mixed columns based on the extracted branch column,
wherein the branch column indicates a column for changing information about the input value of a remarks column according to an inputted character string,
wherein if a first character string is input in the branch column, making an input in the remarks column under a first rule; and
wherein if a second character string is input in the branch column, making an input in the remarks column under a second rule.
|