CPC G06F 40/44 (2020.01) [G06N 3/044 (2023.01); G06N 3/08 (2013.01); G06N 5/04 (2013.01)] | 20 Claims |
1. A method performed by one or more computers, the method comprising:
receiving a network input; and
generating a network output that represents an output image from the network input, wherein the network output comprises a plurality of outputs from a vocabulary of outputs arranged according to an output order, each of the plurality of outputs corresponding to a respective location in the output image, the generating comprising, at each of a plurality of generation time steps:
identifying a current partial network output that has already been generated as of the generation time step;
generating, using a decoder neural network conditioned on (i) at least a portion of the network input and (ii) the current partial network output, a decoder output that defines, for each of a plurality of insertion locations, a respective score distribution over the vocabulary of outputs, wherein each insertion location is a different new location in the output image at which there is no output in the current partial network output;
selecting, using the decoder output, one or more of the insertion locations and, for each selected insertion location, an inserted output from the vocabulary; and
generating a new partial network output that comprises (i) the current partial network output and (ii) for each selected insertion location, the inserted output from the vocabulary inserted at the corresponding new location in the output image.
|