CPC G06F 11/3664 (2013.01) [G06F 11/368 (2013.01); G06F 11/3688 (2013.01); G06N 3/08 (2013.01); G06N 3/10 (2013.01)] | 20 Claims |
1. A method of automated software testing comprising:
selecting, by a reinforcement-learning model, a first action to be taken in a first user interface of a first software instance running in a first live environment;
causing the first action to be performed, wherein the first action includes interacting with a first interface element in the first user interface;
determine a first reward associated with the first action;
selecting, by the reinforcement-learning model, a second action to be taken in the first user interface of a second software instance running in a second live environment;
causing the second action to be performed, wherein the second action includes interacting with a second interface element in the first user interface;
determine a second reward associated with the second action;
generating an updated reinforcement-learning model by training the reinforcement-learning model using the first action, the second action, the first reward, and the second reward; and
storing the updated reinforcement-learning model.
|