CPC G10L 15/22 (2013.01) [G06F 16/245 (2019.01); G06F 16/248 (2019.01); G10L 15/26 (2013.01); G10L 15/30 (2013.01); G10L 15/32 (2013.01); G10L 2015/223 (2013.01)] | 18 Claims |
1. A method implemented by one or more processors, the method comprising:
receiving a spoken query captured in audio data generated by one or more microphones of a client device, the spoken query following an assistant invocation by a user, wherein the assistant invocation is not specific to any particular one of a plurality of automated assistants;
providing an indication of the query to the plurality of automated assistants;
receiving, from a first automated assistant of the plurality of automated assistants, a first response to the query;
receiving, from a second automated assistant of the plurality of automated assistants, a second response to the query;
clustering the responses based on similarity between responses in each of the clusters;
determining, based on the clustering of the responses, whether to combine the first response and the second response;
generating, in response to determining to combine the first response and the second response, a combined response by combining a portion of one or more responses of a first cluster with a portion of one or more responses of a second cluster; and
causing, in response to receiving the spoken query, one or more actions to be performed based on the combined response.
|