| CPC H04W 72/541 (2023.01) [G06N 3/006 (2013.01); G06N 20/00 (2019.01); H04B 7/1851 (2013.01); H04B 7/18539 (2013.01); H04W 72/0453 (2013.01)] | 14 Claims |

|
8. A frequency resource allocation method executed by a processor of a computing system comprising the processor that electrically communicates with a reinforcement learning model trained based on a reinforcement learning technique with a modeling for resources allocation process to determine resources to be allocated for transmitting a signal to a user within a target satellite network, the method comprising:
controlling the reinforcement learning model to output an action:
selecting resources for transmitting the signal to the user based on the action output by the reinforcement learning model;
allocating the selected resources to the user;
determining whether to transmit the signal to the user, before transmitting the signal to the user, based on a collision probability of when the selected resources are used for transmitting the signal to the user so that a probability of actually transmitting the signal to the user using the selected resources does not exceed a first threshold and a second threshold obtained based on a probability that the resources are selected, wherein the collision probability is a probability of collision between the target satellite network and an adjacent existing satellite network operated independently of the target satellite network and utilizing identical resources to the target satellite network,
transmitting the signal to the user using the selected resources, when it is determined to transmit the signal to the user;
receiving information about whether the transmission of the signal is successful or not from the user via a feedback channel after a delayed time, and
updating an internal parameter of the reinforcement learning model with respect to the resources used for transmitting the signal.
|