CPC G06F 8/35 (2013.01) [G06N 3/084 (2013.01); G06N 3/048 (2023.01)] | 20 Claims |
1. A system comprising:
a processor; and a memory that stores a program configured to be executed by the processor, the program including instructions that when executed perform acts that:
access a pre-trained deep learning model trained to generate source code, wherein the pre-trained deep learning model includes a plurality of model parameters, wherein the pre-trained deep learning model comprises an input layer, one or more transformer blocks and an output layer;
tune the pre-trained deep learning model to generate a first custom model through application of a first tuning dataset to the pre-trained deep learning model, wherein the first tuning dataset includes a first prefix that includes a plurality of trainable parameters distinct from the plurality of model parameters, wherein the tuning of the pre-trained deep learning model optimizes the plurality of trainable parameters with the plurality of model parameters frozen, wherein the input layer and the output layer are tuned in a first execution environment of a user space and the one or more transformer blocks are tuned in a second execution environment of a model space, wherein the first execution environment and the second execution environment differ; and
output the first custom model for deployment in an inference system to generate the source code.
|