CPC G10L 21/0216 (2013.01) [H04R 1/1083 (2013.01); G10K 2210/1081 (2013.01); H04R 2460/01 (2013.01)] | 20 Claims |
1. A method of real-time noise reduction for audio signals to enhance, with low latency, voice content relative to non-voice content of the audio signals, comprising:
receiving a time-resolved signal indicative of audio;
generating time-resolved spectral data using temporally localized spectral representations of the time-resolved signal;
determining detection of voice by comparing first filtered data and second filtered data, the first filtered data formed by attenuating temporal variations of the time-resolved spectral data based on a first timescale, the second filtered data formed by attenuating temporal variations of the time-resolved spectral data based on a second timescale different than the first timescale, including
evaluating a frequency-weighted average of distances between the first filtered data and the second filtered data, the distances being associated with corresponding spectral components represented in the time-resolved spectral data, and
comparing the frequency-weighted average to a predetermined detection threshold; and
generating a time-resolved output indicative of noise-reduced audio by processing the time-resolved signal to attenuate the non-voice content relative to the voice content based on determined detection of voice.
|