| CPC G06N 3/08 (2013.01) | 20 Claims |

|
1. A method of generating an output sequence comprising a plurality of outputs from an input sequence comprising a plurality of inputs, the method comprising:
receiving the input sequence;
processing the input sequence using a latent prediction model configured to autoregressively predict a sequence of discrete latent variables that is shorter than the output sequence, wherein each discrete latent variable in the sequence of discrete latent variables is selected from a discrete set of latent variables, and wherein the latent prediction model is configured to, for each discrete latent variable in the sequence:
select the discrete latent variable from the discrete set of latent variables conditioned on the input sequence and on any discrete latent variables in the sequence that have already been generated; and
processing the input sequence and the predicted sequence of discrete latent variables using a parallel decoder model configured to generate the outputs in the output sequence in parallel from the input sequence and the predicted sequence of discrete latent variables, comprising:
processing, using a first deep neural network within the parallel decoder model, the input sequence and the predicted sequence of discrete latent variables that is generated by the latent prediction model and that is shorter than the output sequence to generate a first sequence that has a same length as the output sequence; and
processing, using a decoder deep neural network, the first sequence that is generated from the input sequence and the predicted sequence of discrete latent variables to generate the output sequence in parallel from the first sequence in a single forward pass.
|