US 12,439,220 B2
Apparatus, methods and computer programs for enabling reproduction of spatial audio signals
Juha Vilkamo, Helsinki (FI); and Mikko-Ville Laitinen, Espoo (FI)
Assigned to NOKIA TECHNOLOGIES OY, Espoo (FI)
Appl. No. 17/908,969
Filed by NOKIA TECHNOLOGIES OY, Espoo (FI)
PCT Filed Feb. 23, 2021, PCT No. PCT/FI2021/050130
§ 371(c)(1), (2) Date Sep. 2, 2022,
PCT Pub. No. WO2021/176135, PCT Pub. Date Sep. 10, 2021.
Claims priority of application No. 2003063 (GB), filed on Mar. 3, 2020.
Prior Publication US 2023/0096873 A1, Mar. 30, 2023
Int. Cl. H04S 7/00 (2006.01); H04R 3/14 (2006.01); H04R 5/02 (2006.01); H04S 3/00 (2006.01)
CPC H04S 7/302 (2013.01) [H04R 3/14 (2013.01); H04R 5/02 (2013.01); H04S 3/008 (2013.01); H04S 2400/01 (2013.01); H04S 2400/11 (2013.01); H04S 2400/13 (2013.01); H04S 2400/15 (2013.01); H04S 2420/07 (2013.01)] 20 Claims
OG exemplary drawing
 
1. An apparatus comprising:
at least one processor; and
at least one memory including computer program code; the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to:
obtain audio signals comprising one or more channels;
obtain spatial metadata associated with the audio signals, wherein the spatial metadata comprises information that indicates how to spatially reproduce the audio signals;
obtain information relating to a field of view of video, wherein the video is for display on a display of a rendering device and wherein the video is associated with the audio signals;
generate spatially aligned audio signals by aligning a spatial reproduction of the audio signals with objects in the video based on the obtained spatial metadata and the obtained information relating to the field of view of the video, wherein, prior to the aligning, the spatial reproduction of the audio signals and the objects in the video are misaligned, and wherein generating the spatially aligned audio signals comprises modifying the spatial metadata by adjusting one or more parameters within the spatial metadata; and
reproducing the spatially aligned audio signals from two or more loudspeakers based on the aligning.