US 11,810,562 B2
	Reducing the need for manual start/end-pointing and trigger phrases
Philippe P. Piernot, Palo Alto, CA (US); and Justin G. Binder, Oakland, CA (US)
Assigned to Apple Inc., Cupertino, CA (US)
Filed by Apple Inc., Cupertino, CA (US)
Filed on Aug. 30, 2021, as Appl. No. 17/461,018.
Application 17/461,018 is a continuation of application No. 16/800,456, filed on Feb. 25, 2020, granted, now 11,133,008.
Application 16/800,456 is a continuation of application No. 16/530,708, filed on Aug. 2, 2019, granted, now 10,770,073, issued on Sep. 8, 2020.
Application 16/530,708 is a continuation of application No. 15/656,793, filed on Jul. 21, 2017, granted, now 10,373,617, issued on Aug. 6, 2019.
Application 15/656,793 is a continuation of application No. 14/502,737, filed on Sep. 30, 2014, granted, now 9,715,875, issued on Jul. 25, 2017.
Claims priority of provisional application 62/005,760, filed on May 30, 2014.
Prior Publication US 2021/0390955 A1, Dec. 16, 2021
Int. Cl. G10L 15/22 (2006.01); G10L 15/26 (2006.01); H04W 4/02 (2018.01); G06F 3/16 (2006.01); G06F 3/01 (2006.01); G10L 15/18 (2013.01); G10L 17/00 (2013.01)

CPC G10L 15/22 (2013.01) [G06F 3/013 (2013.01); G06F 3/167 (2013.01); G10L 15/26 (2013.01); H04W 4/025 (2013.01); G06F 2203/0381 (2013.01); G10L 15/1815 (2013.01); G10L 15/1822 (2013.01); G10L 17/00 (2013.01); G10L 2015/223 (2013.01); G10L 2015/227 (2013.01); G10L 2015/228 (2013.01)]

75 Claims

51. An electronic device, comprising:

one or more processors;

memory; and

one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for:

receiving a first spoken input, wherein the first spoken input requests performance of a first task;

activating a virtual assistant operating on the electronic device;

in accordance with activating the virtual assistant, performing, by the activated virtual assistant, the first task based on the first spoken input;

providing, at a first time, a first response indicating the performance of the first task, wherein providing the first response includes providing at least one of audio output and displayed output;

within a predetermined duration from the first time:

monitoring received audio input to identify a second spoken input in the audio input, wherein the second spoken input does not comprise a spoken trigger for activating the virtual assistant;

in accordance with identifying the second spoken input, determining whether to respond to the second spoken input based on a direction of a user's gaze when the second spoken input was received;

in accordance with a determination to respond to the second spoken input, performing a second task based on the second spoken input; and

providing a second response indicating the performance of the second task.