US 11,893,308 B2
	Media playback system with concurrent voice assistance
Dayn Wilberding, Santa Barbara, CA (US); and John Tolomei, Renton, WA (US)
Assigned to Sonos, Inc., Santa Barbara, CA (US)
Filed by Sonos, Inc., Santa Barbara, CA (US)
Filed on Mar. 28, 2022, as Appl. No. 17/656,794.
Application 17/656,794 is a continuation of application No. 16/834,483, filed on Mar. 30, 2020, granted, now 11,288,039.
Application 16/834,483 is a continuation of application No. 16/672,764, filed on Nov. 4, 2019, granted, now 10,606,555, issued on Mar. 31, 2020.
Application 16/672,764 is a continuation of application No. 15/721,141, filed on Sep. 29, 2017, granted, now 10,466,962, issued on Nov. 5, 2019.
Prior Publication US 2022/0222039 A1, Jul. 14, 2022
Int. Cl. G10L 15/22 (2006.01); G10L 15/30 (2013.01); G06F 3/16 (2006.01); H04N 21/436 (2011.01); H04N 21/422 (2011.01); H04N 21/439 (2011.01); G06F 3/04817 (2022.01); G10L 15/08 (2006.01); H04L 12/28 (2006.01); G10L 17/22 (2013.01); G06F 9/451 (2018.01); G06F 3/0488 (2022.01)

CPC G06F 3/167 (2013.01) [G06F 3/04817 (2013.01); G10L 15/08 (2013.01); G10L 15/22 (2013.01); H04L 12/282 (2013.01); H04N 21/42203 (2013.01); H04N 21/4394 (2013.01); H04N 21/43615 (2013.01); G06F 3/0488 (2013.01); G06F 9/453 (2018.02); G10L 17/22 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01)]

20 Claims

1. A playback device comprising:

at least one audio transducer;

at least one microphone;

a network interface;

at least one processor; and

data storage including instructions that are executable by the at least one processor such that the playback device is configured to:

capture a voice input via the at least one microphone, wherein capturing the voice input comprises buffering first voice data;

determine, via a first voice assistant that is capable of processing a set of first keywords, that additional voice input is required to process the voice input, wherein the voice input comprises at least one first keyword;

output, via the at least one audio transducer; an audio prompt for the additional voice input;

capture the additional voice input via the at least one microphone, wherein capturing the additional voice input comprises buffering second voice data;

process, via the first voice assistant, the voice input in combination with the additional voice input;

determine that the first voice assistant is unable to process the voice input with the additional voice input, wherein the additional voice input includes at least one second keyword, and wherein the set of first keywords is exclusive of the at least one second keyword; and

send, via the network interface to a second voice assistant that is capable of processing a set of second keywords, a query corresponding to the voice input for processing, wherein the set of second keywords includes more keywords than the set of first keywords.