CPC G06N 3/08 (2013.01) [G06N 3/02 (2013.01); G06N 3/04 (2013.01); G06N 3/045 (2023.01); G10L 15/16 (2013.01)] | 40 Claims |
1. A method of improving the quality of an attention-based sequence-to-sequence model, the method comprising:
determining an output sequence corresponding to an input sequence based on an attention-based sequence-to-sequence model;
selecting at least one target attention head from among a plurality of attention heads each configured to generate a respective attention weight matrix;
detecting at least one error output token among output tokens constituting the output sequence based on the target attention head; and
correcting the output sequence based on the at least one error output token.
|