| CPC G06F 8/35 (2013.01) [G06N 3/084 (2013.01); G06N 3/048 (2023.01)] | 20 Claims |

|
1. A system comprising:
a processor; and
a memory that stores a program configured to be executed by the processor, the program comprises instructions that when executed perform acts that:
obtain, in a local execution environment, a pre-trained deep learning model trained to perform a first source code generation task, wherein the pre-trained deep learning model comprises a plurality of transformer blocks, wherein the plurality of transformer blocks comprises a plurality of model parameters;
receive, from a remote execution environment, tuning data to tune the pre-trained deep learning model to perform a second source code generation task, wherein the tuning data comprises a plurality of trainable parameters and source code samples, wherein the plurality of trainable parameters is separate from the plurality of model parameters;
tune, in the local execution environment, the pre-trained deep learning model to learn to perform the second source code generation task through application of the tuning data to the plurality of transformer blocks, wherein the plurality of model parameters is kept frozen and the plurality of trainable parameters are updated in the plurality of transformer blocks, wherein the local execution environment differs from the remote execution environment;
output hidden states from a last transformer block of the plurality of transformer blocks to the remote execution environment;
receive from the remote execution environment an error loss obtained from the outputted hidden states; and
update the plurality of trainable parameters in each of the plurality of transformer blocks based on the error loss.
|