| CPC G10L 15/22 (2013.01) [G10L 25/78 (2013.01); H04R 3/04 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |

|
1. A device, comprising:
a first ambient microphone configured to generate a first acoustic signal;
a second ambient microphone configured to generate a second acoustic signal;
a speaker configured to play an audio content signal;
a memory that stores instruction; and
a processor operatively connected to the first ambient microphone, the second ambient microphone, and the speaker, wherein the first and second ambient microphones and the speaker and the memory and the processor are part of a single device, wherein the processor executes the instructions to perform operations, the operations comprising:
generating a first spectral characteristics of the first acoustic signal;
generating a second spectral characteristics of the second acoustic signal;
selecting a first portion of the first spectral characteristics;
selecting a second portion of the second spectral characteristics;
detecting a user's voice based on an analysis of the first acoustic signal and the second acoustic signal wherein the analysis is performed by comparing the first portion to the second portion;
generating a modified audio signal by reducing the volume of the audio content signal if the user's voice has been detected;
mixing the first acoustic signal or a modified first acoustic signal, with the audio content signal or the modified audio signal, to generate a mixed audio content signal; and
sending the mixed audio content signal to the speaker.
|