| CPC H04W 24/10 (2013.01) [G06N 3/02 (2013.01); H01Q 1/125 (2013.01); H04L 41/12 (2013.01); H04W 24/02 (2013.01)] | 17 Claims |

|
1. A computer implemented method performed by a computer system for a telecommunications network, the method comprising:
accessing a network metrics repository to retrieve a baseline dataset from a baseline policy deployed in the telecommunications network for controlling a configurable parameter of the telecommunications network, wherein the baseline dataset comprises a plurality of key performance indicators (KPIs) that each have a continuous value, and a plurality of historical changes made to the configurable parameter;
training a policy model while offline the telecommunications network using the baseline dataset and inverse propensity score, pi, on the plurality of KPIs as inputs to output from the policy model a probability of actions for controlling the configurable parameter, wherein training the policy model while offline further comprises splitting the baseline dataset into a training dataset and a testing dataset and validating performance of the probability of actions of the policy model based on comparison with performance of the probability of actions of the testing dataset; and
deploying the trained policy model to a plurality of cells in the telecommunications network via a plurality of network nodes for controlling the configurable parameter of the telecommunications network.
|