CPC G10L 15/22 (2013.01) [G06F 3/0482 (2013.01); G06F 3/04842 (2013.01); G06F 3/167 (2013.01); G10L 15/1815 (2013.01); G10L 15/1822 (2013.01); G10L 15/26 (2013.01); G10L 17/22 (2013.01)] | 20 Claims |
1. A network microphone device comprising:
at least one physical control;
one or more microphones;
a network interface;
at least one processor; and
a housing carrying the one or more microphones, the network interface, the at least one processor, and data storage including instructions that are executable by the at least one processor such that the network microphone device is configured to:
detect a first input to the at least one physical control;
after detection of the first input to the at least one physical control, enable a first mode, wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the first mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) disable voice input processing via a cloud-based voice assistant service and (b) enable voice input processing via a local natural language unit;
while the first mode is enabled: (i) capture sound data associated with a first voice input via the one or more microphones and (ii) detect, via the local natural language unit, that the first voice input comprises sound data matching one or more keywords from a local natural language unit library of the local natural language unit;
determine, via the local natural language unit, an intent of the first voice input based on at least one of the one or more keywords;
perform a first command according to the determined intent of the first voice input;
detect a second input to the at least one physical control; and
after detection of the second input to the at least one physical control, enable a second mode, wherein the instructions that are executable by the at least one processor such that the network microphone device is configured to enable the second mode comprise instructions that are executable by the at least one processor such that the network microphone device is configured to: (a) enable voice input processing via the cloud-based voice assistant service and (b) disable voice input processing via the local natural language unit.
|