US 12,236,940 B2
Techniques for improved audio processing using acoustic and language identification models
Yehoshua Dissen, Modiin (IL)
Assigned to GONG.io Ltd., Ramat Gan (IL)
Filed by GONG.io Ltd., Ramat Gan (IL)
Filed on Sep. 20, 2022, as Appl. No. 17/933,618.
Prior Publication US 2024/0119924 A1, Apr. 11, 2024
Int. Cl. G10L 15/06 (2013.01); G10L 15/02 (2006.01); G10L 15/065 (2013.01)
CPC G10L 15/063 (2013.01) [G10L 15/02 (2013.01); G10L 15/065 (2013.01)] 19 Claims
OG exemplary drawing
 
1. A method for audio processing, comprising:
tuning hyperparameters of an acoustic model based on outputs of a language identification (LID) model for a training audio data set and outputs of the acoustic model for the training audio data set;
applying the LID model to a first set of features extracted from a processing audio data set in order to produce outputs of the LID model for the processing audio data set; and
applying the acoustic model to a second set of features extracted from the processing audio data set and the outputs of the LID model in order to produce outputs of the acoustic model for the processing audio data set.