US 12,407,771 B2
Method and device for audio input routing
Kazuhiro Ondo, Buffalo, IL (US); Michael P. Labowicz, Palatine, IL (US); Hideki Yoshino, Lake Zurich, IL (US); and Andrew K. Wells, Lindenhurst, IL (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Filed by GOOGLE TECHNOLOGY HOLDINGS LLC, Mountain View, CA (US)
Filed on Dec. 12, 2023, as Appl. No. 18/537,386.
Application 18/537,386 is a continuation of application No. 17/833,446, filed on Jun. 6, 2022, granted, now 11,876,922.
Application 17/833,446 is a continuation of application No. 16/703,003, filed on Dec. 4, 2019, granted, now 11,363,128, issued on Jun. 14, 2022.
Application 16/703,003 is a continuation of application No. 14/142,190, filed on Dec. 27, 2013, abandoned.
Claims priority of provisional application 61/889,938, filed on Oct. 11, 2013.
Claims priority of provisional application 61/857,696, filed on Jul. 23, 2013.
Prior Publication US 2024/0137435 A1, Apr. 25, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 17/00 (2019.01); G06F 16/60 (2019.01); G10L 15/20 (2006.01); G10L 15/28 (2013.01); H04M 1/27 (2006.01); H04M 1/60 (2006.01)
CPC H04M 1/6041 (2013.01) [G06F 16/60 (2019.01); G10L 15/20 (2013.01); G10L 15/28 (2013.01); H04M 1/271 (2013.01); H04M 2250/74 (2013.01)] 17 Claims
OG exemplary drawing
 
1. A method implemented by one or more processors of a device, the method comprising:
monitoring an audio input for an occurrence of a trigger phrase that is spoken by a user of the device while the device is in a low power mode; and
in response to detecting the occurrence of the trigger phrase:
waking the device from the low power mode; and
buffering the audio input while the device wakes up from the low power mode; and
in response to the device waking up from the low power mode, processing the buffered audio input to detect an occurrence of a command phrase that is spoken by the user in a portion of the buffered audio input;
in response to detecting the occurrence of the command phrase that is spoken by the user in the portion of the buffered audio input, determining whether the command phrase spoken by the user of the device is directed to the device or is located in the middle of a conversation, wherein determining whether the command phrase spoken by the user of the device is directed to the device or is located in the middle of a conversation comprises:
determining whether a length of the buffered audio input satisfies a threshold length;
determining, based on the length of the buffered audio input satisfying the threshold length, that the command phrase spoken by the user is located in the middle of the conversation; and
in response to determining the command phrase spoken by the user is located in the middle of the conversation and based on the length of the buffered audio input satisfying the threshold length, invalidating the command phrase.