| CPC H04L 41/16 (2013.01) [H04L 41/12 (2013.01)] | 20 Claims |

|
1. A non-transitory computer-readable medium configured to store computer logic having instructions that, when executed, enable a processing device to perform the steps of:
acknowledging a plurality of subnetworks among a whole network, each subnetwork including a plurality of nodes and being represented by a tunnel group having a plurality of end-to-end tunnels through the respective subnetwork;
selecting a first group of subnetworks from the plurality of subnetworks;
generating a Reinforcement Learning (RL) agent for each subnetwork of the first group, each RL agent based on observations of end-to-end metrics of the end-to-end tunnels of the respective subnetwork, the observations being independent of specific topology information of the respective subnetwork;
training a global model based on the RL agents of the first group of subnetworks; and
applying the global model to an Action Recommendation Engine (ARE) configured for recommending actions that can be taken to improve a state of the whole network.
|