CPC G10L 15/1815 (2013.01) [G06F 3/167 (2013.01); G06F 40/30 (2020.01); G10L 15/1822 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01); G10L 2015/226 (2013.01)] | 20 Claims |
1. A method implemented by one or more processors, the method comprising:
receiving, from a first client device, a first input audio signal;
determining, based on the first input audio signal, a first plurality of candidate requests, each of the first plurality of candidate requests corresponding to a different semantic meaning of the first input audio signal;
determining an interface type of the first client device;
generating a first action data structure for a first candidate request of the first plurality of candidate requests and a second action data structure for a second candidate request of the first plurality of candidate requests, the first action data structure comprising a response to the first candidate request of the first plurality of candidate requests and the second action data structure comprising a response to the second candidate request of the first plurality of candidate requests; and
transmitting, to the first client device, the first action data structure and the second action data structure to be rendered in series by the first client device, wherein the transmitting is performed to cause rendering in series based on the interface type of the first client device.
|