US 12,243,553 B2
Combining of spatial audio parameters
Mikko-Ville Laitinen, Espoo (FI); Lasse Laaksonen, Tampere (FI); Anssi Rämö, Tampere (FI); Tapani Pihlajakuja, Kellokoski (FI); and Adriana Vasilache, Tampere (FI)
Assigned to NOKIA TECHNOLOGIES OY, Espoo (FI)
Appl. No. 17/783,735
Filed by Nokia Technologies Oy, Espoo (FI)
PCT Filed Nov. 13, 2020, PCT No. PCT/FI2020/050752
§ 371(c)(1), (2) Date Jun. 9, 2022,
PCT Pub. No. WO2021/130405, PCT Pub. Date Jul. 1, 2021.
Claims priority of application No. 1919131 (GB), filed on Dec. 23, 2019.
Prior Publication US 2023/0402053 A1, Dec. 14, 2023
Int. Cl. G10L 25/03 (2013.01)
CPC G10L 25/03 (2013.01) 16 Claims
OG exemplary drawing
 
1. An apparatus of an audio encoder comprising at least one processor and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to:
determine or receive a first spherical direction vector comprising an azimuth component and an elevation component for a time frequency tile of the one or more audio signals and a second spherical direction vector comprising an azimuth component and an elevation component for the time frequency tile of the one or more audio signals;
combine the first spherical direction vector and the second spherical direction vector to provide a combined spherical direction vector for the time frequency tile by the apparatus being caused, to:
convert the first spherical direction vector into a first cartesian vector and convert the second spherical direction vector into a second cartesian vector, wherein the first cartesian vector and second cartesian vector each comprise an x-axis component, a y-axis component and a z-axis component, wherein for each respective component the apparatus is caused to;
weight the respective component of the first cartesian vector by a first direct to total energy ratio calculated for the time frequency tile;
weight the respective component of the second cartesian vector by a second direct to total energy ratio calculated for the time frequency tile;
sum the weighted respective component of the first cartesian vector and the weighted respective component of the second cartesian vector to give a combined respective cartesian component, wherein the combined x-axis cartesian component, the combined y-axis cartesian component and the combined z-axis cartesian component form the components of a combined cartesian vector; and
convert the combined x-axis cartesian component, the combined y-axis cartesian component and the combined z-axis cartesian component into the combined spherical direction vector; and
encode at least one of the first spherical direction vector, the second spherical direction vector or the combined spherical direction vector for at least one of storage or transmission.