| CPC G10L 15/1822 (2013.01) [G06F 40/30 (2020.01); G06Q 30/0281 (2013.01); G10L 15/14 (2013.01); G10L 15/16 (2013.01); G06F 16/90332 (2019.01)] | 20 Claims |

|
1. A method of training neural network models to predict an intent of an utterance, the method comprising:
setting an encoder layer of a first neural network model to be trainable;
obtaining a subset of multi-word training utterances from among a first plurality of multi-word utterances, the first plurality of multi-word utterances including a plurality of multi-word utterances, from among a plurality of topic-tagged multi-word utterances, that are tagged with a first topic, from among a plurality of topics included in a topic set;
for each training utterance of the subset of multi-word training utterances,
inputting the training utterance into an input layer of the first neural network model to generate an embedding of the training utterance,
generating predicted intent values based on the embedding of the training utterance, the predicted intent values being a vector of generated probabilities, each of the generated probabilities being a probability that the training utterance corresponds to an intent of a plurality of intents,
determining a predicted intent of the training utterance based on the predicted intent values,
calculating an error value based on differences between the predicted intent values and training intent values, and
adjusting weights of a plurality of trainable layers of the first neural network model based on the calculated error values for each training utterance of the plurality of multi-word training utterances to reduce the calculated error values; and
training a second neural network model to predict an intent of a multi-word utterance based on second training data, the second training data corresponding to a second plurality of multi-word utterances, from among the plurality of topic-tagged multi-word utterances, that are tagged with a second topic from among the plurality of topics included in the topic set.
|