| CPC H04S 1/002 (2013.01) [H04R 5/04 (2013.01); H04S 3/00 (2013.01); H04S 7/303 (2013.01); H04S 3/008 (2013.01); H04S 3/02 (2013.01); H04S 2420/01 (2013.01); H04S 2420/03 (2013.01)] | 14 Claims |

|
1. A method for processing immersive audio content having one or more audio components for dialogue enhancement, wherein each audio component is associated with a spatial location, the method comprising:
obtaining a first audio signal presentation of the audio components intended for reproduction on a first audio reproduction system;
obtaining a set of presentation transform parameters configured to enable transformation of said first audio signal presentation into said second audio signal presentation intended for reproduction on a second audio reproduction system;
obtaining a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation;
transforming the first audio signal presentation to form the second audio signal presentation based on the set of presentation transform parameters;
forming an acoustic environment simulation input signal based on the first audio signal presentation;
applying an acoustic environment simulation to the acoustic environment simulation input signal to generate an acoustic environment simulation output signal;
applying the set of dialogue estimation parameters to the first audio signal presentation to form a dialogue presentation of the dialogue components; and
summing the dialogue presentation with the second audio signal presentation and the acoustic environment simulation output signal to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system;
wherein the second audio signal presentation is a binaural audio signal presentation.
|