US 12,236,947 B2
	Flexible-format voice command
Bart D'hoore, Merelbeke (BE); Christoph Halboth, Aachen (DE); Holger Quast, Merelbeke (BE); Dino Seppi, Merelbeke (BE); Markus Funk, Ulm (DE); Tom Claes, Merelbeke (BE); and Christophe Ris, Merelbeke (BE)
Assigned to Cerence Operating Company, Burlington, MA (US)
Filed by Cerence Operating Company, Burlington, MA (US)
Filed on Jul. 10, 2023, as Appl. No. 18/219,906.
Application 18/219,906 is a continuation of application No. 17/239,894, filed on Apr. 26, 2021, granted, now 11,735,172.
Prior Publication US 2024/0046924 A1, Feb. 8, 2024
Int. Cl. G10L 15/197 (2013.01); G10L 15/06 (2013.01)

CPC G10L 15/197 (2013.01) [G10L 15/063 (2013.01)]

19 Claims

1. A method for processing voice commands from a user, the method comprising:

receiving a first audio input acquired while the user utters a first utterance;

receiving a first video input including video of the user acquired in conjunction with acquiring the first audio input;

determining that the first utterance includes a command directed to a system based at least in part on

processing the first audio input, and

processing the first video input including identifying a visual characteristic associated with the user uttering the first utterance; and

causing the system to act on the command after determining that the first utterance includes the command directed to the system.