| CPC G10L 21/0272 (2013.01) | 19 Claims |

|
1. A method comprising:
transforming, using one or more processors, one or more frames of a two-channel time domain audio signal into a time-frequency domain representation including a plurality of time-frequency tiles, wherein the frequency domain of the time-frequency domain representation includes a plurality of frequency bins grouped into a plurality of subbands;
for each time-frequency tile:
calculating, using the one or more processors, spatial parameters and a level for the time-frequency tile;
modifying, using the one or more processors, the spatial parameters using shift and squeeze parameters;
obtaining, using the one or more processors, a softmask value for each frequency bin using the modified spatial parameters, the level and subband information; and
applying, using the one or more processors, the softmask values to the time-frequency tile to generate a modified time-frequency tile of an estimated audio source.
|