US 11,705,147 B2
Mixed adaptive and fixed coefficient neural networks for speech enhancement
Erik Visser, San Diego, CA (US); Vahid Montazeri, San Diego, CA (US); Shuhua Zhang, San Diego, CA (US); and Lae-Hoon Kim, San Diego, CA (US)
Assigned to QUALCOMM Incorporated, San Diego, CA (US)
Filed by QUALCOMM Incorporated, San Diego, CA (US)
Filed on Apr. 28, 2021, as Appl. No. 17/243,434.
Claims priority of provisional application 63/017,155, filed on Apr. 29, 2020.
Prior Publication US 2021/0343306 A1, Nov. 4, 2021
Int. Cl. G10L 21/0208 (2013.01); G10L 25/30 (2013.01); G06N 3/08 (2023.01); G06N 3/044 (2023.01); G06N 3/045 (2023.01)
CPC G10L 21/0208 (2013.01) [G06N 3/044 (2023.01); G06N 3/045 (2023.01); G06N 3/08 (2013.01); G10L 25/30 (2013.01); G10L 2021/02082 (2013.01)] 30 Claims
OG exemplary drawing
 
1. An apparatus comprising:
memory; and
one or more processors coupled to the memory, the one or more processors being configured to:
receive, by a first neural network portion of a hybrid neural network system, an input comprising audio data and reference data, the audio data comprising speech data, noise data, and echo data;
filter, by the first neural network portion, a portion of the audio data based on adapted coefficients of the first neural network portion, the portion of the audio data comprising at least one of the noise data and/or the echo data, the adapted coefficients comprising coefficients adjusted based on the input and/or an output of the first neural network portion;
generate, by the first neural network portion based on the filtering of the portion of the audio data, filtered audio data comprising the speech data and an unfiltered portion of at least one of the noise data and/or the echo data; and
extract, by a second neural network portion of the hybrid neural network system based on the filtered audio data and the reference data, the speech data from the filtered audio data.