CPC G10L 15/22 (2013.01) [G06Q 30/0621 (2013.01); G06Q 30/0635 (2013.01); G10L 13/00 (2013.01); G10L 15/1815 (2013.01); G10L 15/30 (2013.01); G10L 15/1807 (2013.01); G10L 2015/223 (2013.01)] | 18 Claims |
1. A computer-implemented method comprising:
receiving, from a first device, first input audio data corresponding to a first utterance;
performing speech processing using the first input audio data to determine first intent data;
determining first data is needed to execute a first action corresponding to the first intent data;
determining second data corresponding to a request for the first data;
sending the second data to the first device;
performing processing with regard to the first intent data to determine first output data;
storing the first output data;
after storing the first output data, receiving, from a second device, second input audio data corresponding to a second utterance;
performing speech processing using the second input audio data and the first output data to determine second intent data;
performing processing with regard to the second intent data to determine second output data;
performing speech synthesis using the second output data to determine output audio data responsive to the second utterance; and
sending the output audio data to the second device for output.
|