| CPC H04W 52/346 (2013.01) [H04W 52/241 (2013.01); H04W 52/242 (2013.01)] | 18 Claims |

|
1. A method of controlling transmission power for wireless communication, the method comprising:
obtaining detected transmission power;
generating a state variable and a reward variable based on the detected transmission power, a threshold transmission power, and a channel state; and
training a reinforced learning agent based on the state variable and the reward variable to output an action variable representing the transmission power,
wherein the training of the reinforced learning agent comprises generating, by the reinforced learning agent, the action variable based on the state variable and the reward variable, and
wherein the generating of the action variable comprises randomly generating the action variable with a probability ε, and greedily generating the action variable with a probability (1−ε).
|