CPC G10L 15/22 (2013.01) [G06N 3/08 (2013.01); G10L 13/00 (2013.01); G10L 15/04 (2013.01); G10L 15/16 (2013.01); G10L 15/24 (2013.01); G10L 15/30 (2013.01); G10L 25/90 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |
1. A computer-implemented method applied to a computing device for recognizing speech within a received audio signal, the method comprising:
separating, using a computer-based neural network model, a speaker's speech from an audio signal based on the speaker's audio profile, the audio signal including the speaker's speech and audio signals associated with speech from other speakers or environmental audio signals;
determining a command from the speech;
determining, from the audio signal, a first score reflecting a percentage of confidence in determining the command based on a frequency of using the command by the speaker;
determining, from the audio signal, a second score reflecting a percentage of importance of the command;
determining whether the first score is above a first threshold value and the second score is below a second threshold value; and
causing the command to be executed at the computing device in response to determining that the first score is above the first threshold value and the second score is below the second threshold value.
|