| CPC G06F 11/3684 (2013.01) [G06F 11/3608 (2013.01); G06N 3/08 (2013.01)] | 17 Claims |

|
1. A computer-implemented method comprising:
receiving at least input data to a computer code and output data that the computer code produces corresponding to the input data;
based on the input data and the output data, building a decision tree that links input conditions to the output data;
using the input conditions, running a condition generator model created by fine-tuning a pre-trained programming language model, the condition generator model predicting a next condition that is likely to occur in the computer code;
based on the next condition, generating input instances and running the computer code using the generated input instances; and
based on output instances output by the computer code based on the generated input instances, detecting at least one flow path in the computer code, which had not been previously built in the decision tree,
wherein fine-tuning the pre-trained programming language model includes:
extracting input variable conditions from a source code sample;
using the extracted input variable conditions, creating a labeled dataset including sequences of conditions; and
fine-tuning at least a last layer of the pre-trained programming language model using the labeled dataset including the sequences of conditions to create the condition generator model.
|