CPC G06N 3/08 (2013.01) [G06N 3/045 (2023.01); G10L 15/16 (2013.01)] | 26 Claims |
1. A computer-implemented method, the method comprising:
obtaining, by one or more computing devices, a machine-learned model that has been previously trained on a first training dataset to perform a first task, the machine-learned model including a first set of learnable parameters;
modifying, by the one or more computing devices, the machine-learned model to include a model patch, the model patch including a second set of learnable parameters, wherein the machine-learned model comprises a plurality of layers, and at least some the second set of learnable parameters included in the model patch comprise one or both of scale and bias parameters for one or more layers of the plurality of layers; and
after modifying the machine-learned model to include the model patch, training, by the one or more computing devices, the machine-learned model on a second training dataset to perform a second task that is different from the first task, wherein training, by the one or more computing devices, the machine-learned model on the second training dataset to perform the second task comprises learning new values for the second set of learnable parameters included in the model patch.
|