US 11,862,161 B2
VAS toggle based on device orientation
Fiede Schillmoeller, Waterlands (NL); and Connor Smith, New Hudson, MI (US)
Assigned to Sonos, Inc., Santa Barbara, CA (US)
Filed by Sonos, Inc., Santa Barbara, CA (US)
Filed on Nov. 29, 2021, as Appl. No. 17/536,572.
Application 17/536,572 is a continuation of application No. 16/660,197, filed on Oct. 22, 2019, granted, now 11,189,286.
Prior Publication US 2022/0230635 A1, Jul. 21, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 15/00 (2013.01); G10L 15/22 (2006.01); G06F 3/0482 (2013.01); G06F 3/04842 (2022.01); G06F 3/16 (2006.01); G10L 15/18 (2013.01); G10L 15/26 (2006.01); G10L 17/22 (2013.01)
CPC G10L 15/22 (2013.01) [G06F 3/0482 (2013.01); G06F 3/04842 (2013.01); G06F 3/167 (2013.01); G10L 15/1815 (2013.01); G10L 15/1822 (2013.01); G10L 15/26 (2013.01); G10L 17/22 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A network microphone device comprising:
at least one physical control;
one or more microphones;
a network interface;
at least one processor; and
a housing carrying the one or more microphones, the network interface, the at least one processor, and data storage including instructions that are executable by the at least one processor such that the network microphone device is configured to:
detect a first input to the at least one physical control;
after detection of the first input to the at least one physical control, enable a first mode, wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the first mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) disable voice input processing via a cloud-based voice assistant service and (b) enable voice input processing via a local natural language unit;
while the first mode is enabled: (i) capture sound data associated with a first voice input via the one or more microphones and (ii) detect, via the local natural language unit, that the first voice input comprises sound data matching one or more keywords from a local natural language unit library of the local natural language unit;
determine, via the local natural language unit, an intent of the first voice input based on at least one of the one or more keywords;
perform a first command according to the determined intent of the first voice input;
detect a second input to the at least one physical control; and
after detection of the second input to the at least one physical control, enable a second mode, wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the second mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) enable voice input processing via the cloud-based voice assistant service and (b) disable voice input processing via the local natural language unit.