CPC G06N 5/04 (2013.01) [G06F 40/20 (2020.01); G06N 3/045 (2023.01)] | 20 Claims |
1. A learning device comprising a processor configured to execute operations comprising:
determining scores based on similarity degrees between an input sentence and pieces of external knowledge stored in an external knowledge database;
selecting at least a part of the pieces of external knowledge based on the scores as search results;
acquiring an output to the input sentence based on predetermined arithmetic processing using the input sentence and the selected at least a part of the pieces of external knowledge as inputs;
determining, based on a combination of the input sentence, the acquired output, the selected pieces of external knowledge, and a true output given to the input sentence in advance, a reward, wherein the reward is based on a first index and a second index, the first index indicates correctness of the acquired output relative to the true output, and the second index indicates a quality of the selected at least a part of the pieces of external knowledge; and
learning a first neural network by updating parameters of the first neural network using the reward.
|