US 12,119,927 B2
	Link adaptation optimization with contextual bandits
Tor Kvernvik, Täby (SE); Henrik Nyberg, Stockholm (SE); Christian Skärby, Stockholm (SE); and Raimundas Gaigalas, Hässelby (SE)
Assigned to TELEFONAKTIEBOLAGET LM ERICSSON (PUBL), Stockholm (SE)
Appl. No. 17/440,030
Filed by Telefonaktiebolaget LM Ericsson (publ), Stockholm (SE)
PCT Filed Mar. 18, 2019, PCT No. PCT/SE2019/050239 § 371(c)(1), (2) Date Sep. 16, 2021, PCT Pub. No. WO2020/190182, PCT Pub. Date Sep. 24, 2020.
Prior Publication US 2022/0182175 A1, Jun. 9, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. H04L 1/00 (2006.01); H04W 24/10 (2009.01)

CPC H04L 1/0026 (2013.01) [H04L 1/0034 (2013.01); H04W 24/10 (2013.01)]

16 Claims

1. A method for dynamically selecting a link adaptation policy (LAP), the method comprising:

a first transmission point (TRP) transmitting first data to a user equipment (UE) using a first LAP, wherein the first TRP serves at least a first cell;

receiving a channel quality report transmitted by the UE, the channel quality report comprising channel quality information indicating a quality of a channel between the UE and the first TRP;

obtaining additional information, wherein the additional information comprises: neighbor cell information about a second cell served by a second TRP, distance information indicating a distance between the UE and the first TRP, and/or gain information indicating a radio propagation gain between the UE and the serving node;

using the channel quality information, the additional information, and a machine learning (ML) model to select a LAP from a set of predefined LAPs, the set of predefined LAPs comprising the first LAP and a second LAP; and

the first TRP transmitting second data to the UE using the selected LAP, wherein

selecting the LAP from the set of predefined LAPs comprises:

determining a first reward associated with the first LAP; and

determining a second reward associated with the second LAP.