| CPC G06F 40/44 (2020.01) [G06F 40/30 (2020.01); G06N 3/045 (2023.01)] | 5 Claims |

|
1. An information learning apparatus comprising:
a memory and a processor configured to:
generate, for each of processing units constituting an input sequence included in training data, a third embedded vector by adding a second embedded vector corresponding to an unknown word to a first embedded vector of each processing unit regardless of whether said each processing unit represents the unknown word or not;
execute, using training data, a process based on a learning target parameter of a sequence-to-sequence model to generate an inference result, with the third embedded vector generated for said each processing unit as an input; and
learn the learning target parameter of the sequence-to-sequence model, wherein the learning target parameter is based on an error between the inference result of the execution and a ground truth output corresponding to the input sequence in the training data, wherein a learnt neural network with the learnt learning target parameter converts an input statement into one or more output words, and the input statement comprises the unknown word.
|