US 11,942,097 B2
Multichannel audio encode and decode using directional metadata
David McGrath, Rose Bay (AU)
Assigned to Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
Appl. No. 17/771,877
Filed by Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
PCT Filed Oct. 29, 2020, PCT No. PCT/US2020/057885
§ 371(c)(1), (2) Date Apr. 26, 2022,
PCT Pub. No. WO2021/087063, PCT Pub. Date May 6, 2021.
Claims priority of provisional application 63/086,465, filed on Oct. 1, 2020.
Claims priority of provisional application 62/927,790, filed on Oct. 30, 2019.
Prior Publication US 2022/0392462 A1, Dec. 8, 2022
Int. Cl. G10L 19/008 (2013.01); G10L 19/02 (2013.01)
CPC G10L 19/008 (2013.01) 20 Claims
OG exemplary drawing
 
1. A method of processing a spatial audio signal for generating a compressed representation of the spatial audio signal, the method comprising:
analyzing the spatial audio signal to determine directions of arrival for one or more audio elements in an audio scene represented by the spatial audio signal;
for at least one frequency subband of the spatial audio signal, determining respective indications of signal power associated with the determined directions of arrival;
generating metadata comprising direction information and energy information, with the direction information comprising indications of the determined directions of arrival of the one or more audio elements and the energy information comprising respective indications of signal power associated with the determined directions of arrival;
generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and
outputting, as the compressed representation of the spatial audio signal, the channel-based audio signal and the metadata.