| CPC G10L 15/34 (2013.01) [G06F 3/167 (2013.01); G10L 15/22 (2013.01); G10L 15/30 (2013.01); G10L 15/01 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |

|
1. A method, comprising:
receiving, by a device, audio data;
sending, by the device and based at least in part on the receiving the audio data, event data representing an event to a remote speech processing system;
performing, by a local speech processing component of the device, speech processing on the audio data to generate intent data indicating an intent associated with the audio data;
determining, based at least in part on confidence data associated with the intent, to wait an amount of time for directive data from the remote speech processing system;
receiving the directive data from the remote speech processing system prior to expiration of the amount of time;
determining, by the device, that the device is to respond to the audio data with the directive data; and
at least one of suspending or terminating, by the device, execution of the local speech processing component based at least in part on the determining that the device is to respond to the audio data with the directive data.
|