US 12,236,947 B2
Flexible-format voice command
Bart D'hoore, Merelbeke (BE); Christoph Halboth, Aachen (DE); Holger Quast, Merelbeke (BE); Dino Seppi, Merelbeke (BE); Markus Funk, Ulm (DE); Tom Claes, Merelbeke (BE); and Christophe Ris, Merelbeke (BE)
Assigned to Cerence Operating Company, Burlington, MA (US)
Filed by Cerence Operating Company, Burlington, MA (US)
Filed on Jul. 10, 2023, as Appl. No. 18/219,906.
Application 18/219,906 is a continuation of application No. 17/239,894, filed on Apr. 26, 2021, granted, now 11,735,172.
Prior Publication US 2024/0046924 A1, Feb. 8, 2024
Int. Cl. G10L 15/197 (2013.01); G10L 15/06 (2013.01)
CPC G10L 15/197 (2013.01) [G10L 15/063 (2013.01)] 19 Claims
OG exemplary drawing
 
1. A method for processing voice commands from a user, the method comprising:
receiving a first audio input acquired while the user utters a first utterance;
receiving a first video input including video of the user acquired in conjunction with acquiring the first audio input;
determining that the first utterance includes a command directed to a system based at least in part on
processing the first audio input, and
processing the first video input including identifying a visual characteristic associated with the user uttering the first utterance; and
causing the system to act on the command after determining that the first utterance includes the command directed to the system.