CPC G06F 3/167 (2013.01) [G06F 3/04817 (2013.01); G10L 15/08 (2013.01); G10L 15/22 (2013.01); H04L 12/282 (2013.01); H04N 21/42203 (2013.01); H04N 21/4394 (2013.01); H04N 21/43615 (2013.01); G06F 3/0488 (2013.01); G06F 9/453 (2018.02); G10L 17/22 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |
1. A playback device comprising:
at least one audio transducer;
at least one microphone;
a network interface;
at least one processor; and
data storage including instructions that are executable by the at least one processor such that the playback device is configured to:
capture a voice input via the at least one microphone, wherein capturing the voice input comprises buffering first voice data;
determine, via a first voice assistant that is capable of processing a set of first keywords, that additional voice input is required to process the voice input, wherein the voice input comprises at least one first keyword;
output, via the at least one audio transducer; an audio prompt for the additional voice input;
capture the additional voice input via the at least one microphone, wherein capturing the additional voice input comprises buffering second voice data;
process, via the first voice assistant, the voice input in combination with the additional voice input;
determine that the first voice assistant is unable to process the voice input with the additional voice input, wherein the additional voice input includes at least one second keyword, and wherein the set of first keywords is exclusive of the at least one second keyword; and
send, via the network interface to a second voice assistant that is capable of processing a set of second keywords, a query corresponding to the voice input for processing, wherein the set of second keywords includes more keywords than the set of first keywords.
|