| CPC H04M 1/6041 (2013.01) [G06F 16/60 (2019.01); G10L 15/20 (2013.01); G10L 15/28 (2013.01); H04M 1/271 (2013.01); H04M 2250/74 (2013.01)] | 17 Claims |

|
1. A method implemented by one or more processors of a device, the method comprising:
monitoring an audio input for an occurrence of a trigger phrase that is spoken by a user of the device while the device is in a low power mode; and
in response to detecting the occurrence of the trigger phrase:
waking the device from the low power mode; and
buffering the audio input while the device wakes up from the low power mode; and
in response to the device waking up from the low power mode, processing the buffered audio input to detect an occurrence of a command phrase that is spoken by the user in a portion of the buffered audio input;
in response to detecting the occurrence of the command phrase that is spoken by the user in the portion of the buffered audio input, determining whether the command phrase spoken by the user of the device is directed to the device or is located in the middle of a conversation, wherein determining whether the command phrase spoken by the user of the device is directed to the device or is located in the middle of a conversation comprises:
determining whether a length of the buffered audio input satisfies a threshold length;
determining, based on the length of the buffered audio input satisfying the threshold length, that the command phrase spoken by the user is located in the middle of the conversation; and
in response to determining the command phrase spoken by the user is located in the middle of the conversation and based on the length of the buffered audio input satisfying the threshold length, invalidating the command phrase.
|