| CPC G06F 8/65 (2013.01) | 20 Claims |

|
1. A computer-implemented method performed by a software update distribution service, the method comprising:
receiving an indication of a selected objective to be pursued by a software-based reinforcement learning (RL) agent in performing actions relating to the updating of software applications installed on a set of client computing devices;
configuring the software-based RL agent to perform the actions subject to a randomly selected first schedule, wherein the actions comprise i) instructing an update agent executing at a client computing device to download a software update, and ii) instructing an update agent executing at a client computing device to install a previously downloaded software update;
invoking the software-based RL agent to perform a first software update procedure for a first software update by performing the actions subject to the randomly selected first schedule;
upon determining a change in a status, calculating a penalty associated with the software-based RL agent completing the first software update procedure for the first software update by performing the actions subject to the randomly selected first schedule;
comparing the penalty to a predetermined threshold; and
upon determining that the penalty exceeds the predetermined threshold, reconfiguring the software-based RL agent to perform actions subject to a second schedule, the actions relating to the updating of software applications executing on the set of client computing devices with a second software update.
|