| CPC G10L 15/22 (2013.01) [G06N 3/08 (2013.01); G10L 13/00 (2013.01); G10L 15/04 (2013.01); G10L 15/16 (2013.01); G10L 15/24 (2013.01); G10L 15/30 (2013.01); G10L 25/90 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |

|
1. A computer-implemented method applied to a computing device for recognizing a command within an audio signal, the method comprising:
identifying, by a processor, a speaker from the audio signal;
generating a speech spectrogram of the audio signal;
separating, using a computer-based machine learning model, the audio signal to obtain a segment of the audio signal containing a speech associated with the identified speaker based on the speech spectrogram of the audio signal and an audio profile associated with the identified speaker;
determining, from the segment of the audio signal, the command from the speech associated with the identified speaker;
determining, from the segment of the audio signal, a score reflecting a value of confidence that the command determined is a valid command; and
in response to determining that the score is above a first threshold value, causing the command to be executed at the computing device.
|