| CPC G10L 21/0364 (2013.01) [G10L 21/034 (2013.01); G10L 25/51 (2013.01); G10L 25/78 (2013.01)] | 15 Claims |

|
1. A method comprising:
dividing a binaural speech signal into frames;
applying a time-frequency transform to each frame;
computing features of the frames based on a time-frequency representation;
classifying, by a classifier, each frame as self speech or external speech, based at least in part on a subset of features;
computing a dissimilarity function based on a subset of features;
segmenting the signal at peaks of the dissimilarity function;
for each segment, determining a respective overall class among self speech or external speech by aggregating classifier data of the frames belonging to the segment; and
processing each segment with a speech enhancement chain whose settings are based on determined overall class for such segment.
|