US 12,462,826 B2
Adapting sibilance detection based on detecting specific sounds in an audio signal
Yuanxing Ma, Beijing (CN); Kai Li, Beijing (CN); and Qianqian Fang, Beijing (CN)
Assigned to Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
Appl. No. 17/627,116
Filed by Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
PCT Filed Jul. 16, 2020, PCT No. PCT/US2020/042400
§ 371(c)(1), (2) Date Jan. 13, 2022,
PCT Pub. No. WO2021/011814, PCT Pub. Date Jan. 21, 2021.
Claims priority of provisional application 62/884,320, filed on Aug. 8, 2019.
Claims priority of application No. PCT/CN2019/096399 (WO), filed on Jul. 17, 2019.
Prior Publication US 2022/0383889 A1, Dec. 1, 2022
Int. Cl. G10L 21/0216 (2013.01); G10L 25/18 (2013.01)
CPC G10L 21/0216 (2013.01) [G10L 25/18 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
receiving an audio signal;
determining whether a current portion of the audio signal comprises speech or non-speech;
extracting from the audio signal a plurality of time-frequency features, the plurality of time-frequency features comprising one or more short-term features;
determining whether an impulsive sound is present in the extracted one or more short-term features;
in accordance with determining that non-speech is present in the current portion of the audio signal, adapting one or more thresholds of a sibilance detector to a first set of one or more threshold values, the adapting including giving a first weight to a first output value resulting from the determination of whether the impulsive sound is present;
in accordance with determining that speech is present in the current portion of the audio signal, adapting the one or more thresholds of the sibilance detector to a second set of one or more threshold values different than the first set of one or more threshold values, the adapting including giving a second weight to the first output value resulting from the determination of whether the impulsive sound is present;
detecting sibilance in the audio signal, using the sibilance detector with the one or more adapted thresholds; and
suppressing the sibilance in the audio signal by applying to the audio signal one or more gains determined in response to the sibilance in the audio signal.