CPC G06N 3/08 (2013.01) [G06F 17/18 (2013.01); G06F 40/44 (2020.01); G06F 40/51 (2020.01); G06N 3/045 (2023.01); G10L 15/16 (2013.01)] | 27 Claims |
1. A processor implemented method, comprising:
generating, using a first decoding model, a target sentence corresponding to a source sentence;
generating, using a second decoding model, words included in another target sentence in an order different from a generated word order of the words as also included in the target sentence;
generating, using the second decoding model, reward information associated with the target sentence and the words; and
training, based on the reward information, an updated sentence generation model, including resetting respective weights of nodes in the first decoding model,
wherein the first decoding model, second decoding model, and sentence generation model are machine learning models.
|