US 12,014,743 B2
	Spatial audio parameter merging
Lasse Laaksonen, Tampere (FI); Anssi Ramo, Tampere (FI); Mikko-Ville Laitinen, Espoo (FI); and Tapani Pihlajakuja, Vantaa (FI)
Assigned to Nokia Technogies Oy, Espoo (FI)
Appl. No. 17/058,699
Filed by Nokia Technologies Oy, Espoo (FI)
PCT Filed May 29, 2019, PCT No. PCT/FI2019/050413 § 371(c)(1), (2) Date Nov. 25, 2020, PCT Pub. No. WO2019/229299, PCT Pub. Date Dec. 5, 2019.
Claims priority of application No. 1808929 (GB), filed on May 31, 2018.
Prior Publication US 2021/0210104 A1, Jul. 8, 2021
Int. Cl. G06F 3/16 (2006.01); G10L 19/008 (2013.01); H04R 3/00 (2006.01); H04S 3/00 (2006.01); H04S 7/00 (2006.01)

CPC G10L 19/008 (2013.01) [G06F 3/165 (2013.01); H04R 3/005 (2013.01)]

20 Claims

1. An apparatus comprising:

at least one processor; and

at least one memory including instructions that, when executed by the at least one processor, cause the apparatus at least to perform:

determining, for at least one first audio signal of an audio signal format, multiple metadata parameters comprising at least one spatial audio parameter and at least one first audio signal energy parameter;

determining, for at least one further audio signal of a further audio signal format, multiple further metadata parameters comprising at least one further spatial audio parameter and at least one further audio signal energy parameter;

determining a value based on multiplication of the determined multiple metadata parameters;

determining a further value based on multiplication of the determined multiple further metadata parameters;

comparing the value and the further value to select at least one of: the at least one spatial audio parameter from the determined multiple metadata parameters, or the at least one further spatial audio parameter from the multiple further metadata parameters; and

generating a metadata based on the selection, the generated metadata comprising at least one of: the at least one spatial audio parameter; or the at least one further spatial audio parameter, wherein the generated metadata is configured to be associated with a combined audio signal formed based on the at least one first audio signal and the at least one further audio signal.