CPC G06N 5/04 (2013.01) [G06N 20/00 (2019.01)] | 14 Claims |
1. A computerized method of training a computer executed model for recognizing at least one numerical quantity, the method carried out by one of more processors, the method comprising:
receiving, as input, at least one unit expression;
searching for numeric values and the at least one unit expression in a text corpus, the text corpus comprising sets of words and frequency of occurrence of each of the sets, the search resulting in identification of sets that comprise a combination of a numeric value and the at least one unit expression;
generating sentences from the text corpus by applying the identified sets as input;
generating a training dataset by auto labelling the identified sets within the generated sentences based on the at least one numerical quantity; and
training the model by providing input based on the training dataset.
|