CPC G10L 17/26 (2013.01) [G10L 15/183 (2013.01); G10L 15/22 (2013.01); G10L 15/34 (2013.01)] | 20 Claims |
3. A computer-implemented method comprising:
receiving first audio data representing a first portion of an utterance;
performing first automatic speech recognition (ASR) processing on the first audio data using a first ASR component of a first device to generate first data representing a possible transcription of the first portion of the utterance;
sending the first data to a second device;
processing the first audio data to identify one or more characteristics of the first audio data;
sending second data representing the one or more characteristics to the second device;
performing second ASR processing on the first data using a second ASR component of the second device to determine a first ASR hypothesis corresponding to the first portion of the utterance;
performing first natural language understanding (NLU) processing on the first ASR hypothesis using an NLU component of the second device to generate first NLU results data; and
processing at least a portion of the first NLU results data using a skill component of the second device to perform a first action responsive to the first portion of the utterance.
|