| CPC G06F 3/167 (2013.01) [G10L 15/30 (2013.01); H04L 12/2803 (2013.01); H04L 12/40052 (2013.01); H04L 67/10 (2013.01); H04L 67/561 (2022.05); G10L 2015/223 (2013.01); G10L 2015/225 (2013.01); H04L 2012/2849 (2013.01); H04L 41/0668 (2013.01); H04L 43/0817 (2013.01)] | 17 Claims |

|
1. A system comprising:
at least one processor;
at least one non-transitory computer-readable medium; and
program instructions stored on the at least one non-transitory computer-readable medium that are executable by the at least one processor such that the system is configured to:
determine that a first playback device has detected a voice input comprising a voice command via at least one microphone of the first playback device, wherein the first playback device is configured to receive voice commands for a media playback system comprising the first playback device and a second playback device;
determine (i) a first portion of a response to the voice input comprising the command and (ii) a second portion of the response to the voice input comprising the command;
cause the first playback device to perform the first portion of the response to the voice input;
determine that the first playback device is not configured to perform the second portion of the response to the voice input;
determine that the second playback device is configured to perform the second portion of the response to the voice input; and
cause the second playback device to perform the second portion of the response to the voice input by (i) determining that a fallback device is configured to perform the second portion of the response and (ii) causing the fallback device to perform the second portion of the response to the voice input.
|