CPC G10L 15/22 (2013.01) [G06F 3/013 (2013.01); G06F 3/167 (2013.01); G10L 15/26 (2013.01); H04W 4/025 (2013.01); G06F 2203/0381 (2013.01); G10L 15/1815 (2013.01); G10L 15/1822 (2013.01); G10L 17/00 (2013.01); G10L 2015/223 (2013.01); G10L 2015/227 (2013.01); G10L 2015/228 (2013.01)] | 75 Claims |
51. An electronic device, comprising:
one or more processors;
memory; and
one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for:
receiving a first spoken input, wherein the first spoken input requests performance of a first task;
activating a virtual assistant operating on the electronic device;
in accordance with activating the virtual assistant, performing, by the activated virtual assistant, the first task based on the first spoken input;
providing, at a first time, a first response indicating the performance of the first task, wherein providing the first response includes providing at least one of audio output and displayed output;
within a predetermined duration from the first time:
monitoring received audio input to identify a second spoken input in the audio input, wherein the second spoken input does not comprise a spoken trigger for activating the virtual assistant;
in accordance with identifying the second spoken input, determining whether to respond to the second spoken input based on a direction of a user's gaze when the second spoken input was received;
in accordance with a determination to respond to the second spoken input, performing a second task based on the second spoken input; and
providing a second response indicating the performance of the second task.
|