| CPC G06N 3/045 (2023.01) [G06F 17/16 (2013.01); G06N 3/08 (2013.01)] | 20 Claims |

|
1. A method performed by one or more computers, the method comprising:
processing inputs using a sequence of deep neural networks (DNNs),
wherein each DNN in the sequence of DNNs has been trained to perform a respective machine learning task of a sequence of machine learning tasks, wherein the sequence of DNNs comprises:
a first DNN that corresponds to a first machine learning task of the sequence of machine learning tasks, wherein
(i) the first DNN comprises a first plurality of indexed layers, and
(ii) each layer in the first plurality of indexed layers is configured to receive a respective layer input and process the respective layer input to generate a respective layer output; and
one or more subsequent DNNs corresponding to one or more respective subsequent machine learning tasks of the sequence of machine learning tasks, wherein
(i) each subsequent DNN comprises a respective plurality of indexed layers, and
(ii) each layer in a respective plurality of indexed layers with index i greater than one receives input from
(i) a preceding layer of the respective subsequent DNN, and
(ii) one or more preceding layers of respective preceding DNNs through respective outputs of respective non-linear lateral connections, wherein a preceding layer is a layer whose index is one less than the index i, and wherein the respective non-linear lateral connections represent a learned, non-linear transformation of the respective layer outputs of the one or more preceding layers of the respective preceding DNNs; and
(iii) each layer in the respective plurality of indexed layers with index i greater than one:
generates a respective activation by processing (i) the input received from the preceding layer of the respective subsequent DNN and (ii) the respective outputs of each of the respective non-linear lateral connections applied to the respective layer outputs of the one or more preceding layers of the respective preceding DNNs;
wherein processing the inputs comprises:
processing a first input for a last machine learning task of the sequence of machine learning task;
processing the first input using the respective DNNs of the sequence of DNNs, and
using a last subsequent DNN in the sequence to generate a last subsequent DNN output for performing the last machine learning task.
|