US 12,249,321 B2
Utilizing pre-event and post-event input streams to engage an automated assistant
Matthew Sharifi, Kilchberg (CH); Tom Hume, San Francisco, CA (US); Mohamad Hassan Mohamad Rom, Zurich (CH); Jan Althaus, Zurich (CH); and Diego Melendo Casado, Mountain View, CA (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Jul. 7, 2022, as Appl. No. 17/859,068.
Application 17/859,068 is a continuation of application No. 16/349,871, granted, now 11,423,885, previously published as PCT/US2019/018750, filed on Feb. 20, 2019.
Prior Publication US 2022/0335930 A1, Oct. 20, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 15/22 (2006.01); G10L 15/18 (2013.01); G10L 15/08 (2006.01)
CPC G10L 15/1815 (2013.01) [G10L 15/22 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01)] 20 Claims
OG exemplary drawing
 
10. A system comprising one or more processors and memory storing instructions that, in response to execution by the one or more processors, cause the one or more processors to:
execute an automated assistant in an inactive listening state at least in part on a computing device operated by a user;
while in the inactive listening state, and prior to one or more hot words being detected that trigger transition of the automated assistant from the inactive listening state into an active listening state:
capture, at a microphone of the computing device operated by the user, and
temporarily storing in a memory buffer of the computing device operated by
the user, as a pre-event input stream, first audio data representing a first utterance spoken by the user;
detect the one or more hot words in a second utterance spoken by the user after the first utterance;
after the one or more hot words are detected in the second utterance:
capture, at the microphone as a post-event input stream, second audio data containing a third utterance spoken by the user after the second utterance;
determine respective measures of fitness of the pre-event and post-event input streams for triggering responsive action by the automated assistant;
based on the respective measures of fitness, select an input stream from a set of the pre-event input stream and the post-event input stream for fulfillment by the automated assistant; and
cause the automated assistant to perform one or more responsive actions based on a command associated with the selected input stream.