CPC G10L 15/20 (2013.01) [G10L 15/22 (2013.01); G10L 15/28 (2013.01); G10L 21/02 (2013.01); G10L 21/0272 (2013.01); G10L 21/0364 (2013.01); G10L 25/78 (2013.01); G10L 25/84 (2013.01); G10L 21/0216 (2013.01); G10L 25/21 (2013.01); G10L 2021/02161 (2013.01); G10L 2025/783 (2013.01)] | 16 Claims |
1. A computer-implemented method when executed on data processing hardware of a computing device causes the data processing hardware to perform operations comprising:
receiving an audio signal detected by a first microphone in a group of microphones of the computing device while a second microphone in the group of microphones is powered off;
while the second microphone is powered off, determining an audio signal energy level of the audio signal detected by the first microphone has deviated from a baseline audio signal energy level by more than a threshold amount;
in response to determining that the audio signal energy level of the audio signal detected by the first microphone has deviated from the baseline audio signal energy level by more than the threshold amount:
triggering the second microphone to power on; and
triggering a voice activity detector to power on;
performing, using the voice activity detector that is powered on, voice activity detection on the audio signal to determine whether speech is detected in the audio signal detected by the first microphone;
buffering a voice signal based on audio signals detected by the first microphone and the second microphone; and
in response to determining that speech is detected in the audio signal detected by the first microphone:
estimating a signal-to-noise ratio (SNR) of the buffered voice signal; and
performing, based on the estimated SNR of the buffered voice signal, noise suppression on the buffered voice signal to provide a noise suppressed voice signal.
|