CPC G10L 21/0216 (2013.01) [H04R 1/1083 (2013.01); G10K 2210/1081 (2013.01); H04R 2460/01 (2013.01)] | 20 Claims |
1. A method of real-time noise reduction for audio signals to enhance, with low latency, voice content relative to non-voice content of the audio signals, comprising:
receiving a time-resolved signal indicative of audio;
generating time-resolved spectral data using temporally localized spectral representations of the time-resolved signal;
determining detection of voice by comparing first filtered data and second filtered data, the first filtered data formed by attenuating temporal variations of the time-resolved spectral data based on a first timescale, the second filtered data formed by attenuating temporal variations of the time-resolved spectral data based on a second timescale different than the first timescale; and
generating a time-resolved output indicative of noise-reduced audio by processing the time-resolved signal to attenuate the non-voice content relative to the voice content based on determined detection of voice,
wherein the first timescale is greater than the second timescale, and a spectrum of the non-voice content varies over a timescale greater than the second timescale such that a frequency-weighted sum of squared differences, over frequencies associated with voice and non-voice content, between components of a time-average of the spectrum of the non-voice content over the first timescale and components of a time-average of the spectrum of the non-voice content over the second timescale is at most 0.1% of a frequency-weighted sum of squares of components of a time-average of the spectrum of the non-voice content over the first timescale.
|