CPC G10L 17/04 (2013.01) [G10L 15/083 (2013.01); G10L 15/22 (2013.01); G10L 17/14 (2013.01); G10L 17/18 (2013.01); G10L 17/22 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |
1. A computer-implemented method when executed on data processing hardware causes the data processing hardware to perform operations comprising:
generating an audio signal encoding an utterance captured by a microphone of a user device, the user device having a plurality of different users;
transmitting, from the user device, the audio signal encoding the utterance to a server in communication with the user device, the server configured to:
determine a speaker of the utterance from one of the plurality of different users of the user device based on a comparison between the audio signal encoding the utterance and corresponding speaker verification data stored on the server for each user of the plurality of different users of the user device; and
process the audio signal encoding the utterance using a speech recognition module to identify a particular action for the user device to execute; and
executing the particular action identified by the server, the particular action when executed causing a particular application to launch on the user device based on corresponding user permissions associated with the speaker determined by the server to access the particular action.
|