US 12,462,826 B2
	Adapting sibilance detection based on detecting specific sounds in an audio signal
Yuanxing Ma, Beijing (CN); Kai Li, Beijing (CN); and Qianqian Fang, Beijing (CN)
Assigned to Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
Appl. No. 17/627,116
Filed by Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
PCT Filed Jul. 16, 2020, PCT No. PCT/US2020/042400 § 371(c)(1), (2) Date Jan. 13, 2022, PCT Pub. No. WO2021/011814, PCT Pub. Date Jan. 21, 2021.
Claims priority of provisional application 62/884,320, filed on Aug. 8, 2019.
Claims priority of application No. PCT/CN2019/096399 (WO), filed on Jul. 17, 2019.
Prior Publication US 2022/0383889 A1, Dec. 1, 2022
Int. Cl. G10L 21/0216 (2013.01); G10L 25/18 (2013.01)

CPC G10L 21/0216 (2013.01) [G10L 25/18 (2013.01)]

20 Claims

1. A method comprising:

receiving an audio signal;

determining whether a current portion of the audio signal comprises speech or non-speech;

extracting from the audio signal a plurality of time-frequency features, the plurality of time-frequency features comprising one or more short-term features;

determining whether an impulsive sound is present in the extracted one or more short-term features;

in accordance with determining that non-speech is present in the current portion of the audio signal, adapting one or more thresholds of a sibilance detector to a first set of one or more threshold values, the adapting including giving a first weight to a first output value resulting from the determination of whether the impulsive sound is present;

in accordance with determining that speech is present in the current portion of the audio signal, adapting the one or more thresholds of the sibilance detector to a second set of one or more threshold values different than the first set of one or more threshold values, the adapting including giving a second weight to the first output value resulting from the determination of whether the impulsive sound is present;

detecting sibilance in the audio signal, using the sibilance detector with the one or more adapted thresholds; and

suppressing the sibilance in the audio signal by applying to the audio signal one or more gains determined in response to the sibilance in the audio signal.