US 12,260,871 B2
	Devices, systems, and methods of noise reduction
Craig Fraser, Victoria (CA); Daniel Davies, Victoria (CA); John Horstmann, San Jose, CA (US); and Lars Christensen, Sidney (CA)
Filed by Beacon Hill Innovations Ltd., Victoria (CA)
Filed on May 28, 2024, as Appl. No. 18/675,981.
Application 18/675,981 is a continuation of application No. 17/528,874, filed on Nov. 17, 2021, granted, now 12,033,650.
Prior Publication US 2024/0312473 A1, Sep. 19, 2024
Int. Cl. G10L 21/0216 (2013.01); H04R 1/10 (2006.01)

CPC G10L 21/0216 (2013.01) [H04R 1/1083 (2013.01); G10K 2210/1081 (2013.01); H04R 2460/01 (2013.01)]

20 Claims

1. A method of real-time noise reduction for audio signals to enhance, with low latency, voice content relative to non-voice content of the audio signals, comprising:

receiving a time-resolved signal indicative of audio;

generating time-resolved spectral data using temporally localized spectral representations of the time-resolved signal;

determining detection of voice by comparing first filtered data and second filtered data, the first filtered data formed by attenuating temporal variations of the time-resolved spectral data based on a first timescale, the second filtered data formed by attenuating temporal variations of the time-resolved spectral data based on a second timescale different than the first timescale; and

generating a time-resolved output indicative of noise-reduced audio by processing the time-resolved signal to attenuate the non-voice content relative to the voice content based on determined detection of voice,

wherein the first timescale is greater than the second timescale, and a spectrum of the non-voice content varies over a timescale greater than the second timescale such that a frequency-weighted sum of squared differences, over frequencies associated with voice and non-voice content, between components of a time-average of the spectrum of the non-voice content over the first timescale and components of a time-average of the spectrum of the non-voice content over the second timescale is at most 0.1% of a frequency-weighted sum of squares of components of a time-average of the spectrum of the non-voice content over the first timescale.