| CPC G06F 8/38 (2013.01) [G06F 3/0484 (2013.01); G06F 8/33 (2013.01); G06F 40/40 (2020.01); G06N 3/0455 (2023.01)] | 15 Claims |

|
1. A computer-implemented method, comprising:
receiving a training dataset comprising a plurality of training samples, each training sample comprising:
a graphical user interface that includes a plurality of graphical elements; and
a natural language textual description comprising a single phrase that describes the graphical user interface that includes the plurality of graphical elements;
generating, for each graphical user interface, graphical attribute data that describes, for each graphical element of the graphical user interface, an attribute type of the graphical element, and a position of the graphical element;
generating, for each natural language textual description, using a pre-trained word embedding model, an embedding vector of the natural language textual description; and
training a machine learning model, based on the graphical attribute data and the embedding vector for each training sample, to generate, as output, prediction data that is indicative of graphical elements in a graphical user interface, wherein the machine learning model comprises a transformer based model that includes:
an encoder that receives the embedding vector of the natural language textual description and processes the embedding vector to generate an output vector; and
a decoder that receives the output vector and the graphical attribute data generated from the training sample and is trained to generate the prediction data.
|