| CPC G06F 3/0484 (2013.01) [G06F 3/167 (2013.01); G10L 15/08 (2013.01); G10L 15/22 (2013.01); G10L 2015/088 (2013.01)] | 15 Claims |

|
1. A system comprising:
at least one computer processor; and
one or more computer storage media storing computer-useable instructions that, when used by the at least one computer processor, cause the at least one computer processor to perform operations comprising:
detecting a first user action of a first user;
subsequent to the detecting, capturing audio data comprising a voice utterance of the first user;
receiving, via a user interface, a manual user input of the first user;
receiving an indication that the manual user input of the first user was performed by the first user later in time than the voice utterance of the first user; and
based at least in part on the indication that the manual user input of the first user was performed by the first user later in time than the voice utterance of the first user, responding to only the manual user input and refraining from responding to the voice utterance.
|