| CPC G06F 11/3636 (2013.01) [G06N 3/08 (2013.01); G06N 20/00 (2019.01)] | 18 Claims |

|
1. A system comprising:
one or more processors; and
a memory that stores one or more programs that are configured to be executed by the one or more processors, the one or more programs including instructions to perform actions that:
obtain a large language model trained on source code;
generate a first pre-training dataset including a first plurality of source code samples, each source code sample of the first plurality of source code samples paired with a corresponding code execution trace, wherein each of the source code samples of the first pre-training dataset having a single-line code execution, wherein the code execution trace represents dynamic behavior of the source code sample during execution of the source code sample;
train the large language model to learn to predict a code execution trace with the first pre-training dataset;
generate a second pre-training dataset including a second plurality of source code samples, each source code sample of the second plurality of source code samples paired with a corresponding code execution trace, wherein each of the source code samples of the second pre-training dataset having multiple-line code execution; and
train the large language model to learn to predict a code execution trace with the second pre-training dataset, wherein the large language model comprises a unified cross-modal neural transformer model with attention.
|