US 11,715,471 B2
	Systems, methods, and storage media for performing actions based on utterance of a command
Sanket Agarwal, San Francisco, CA (US)
Assigned to Suki AI, Inc., Redwood City, CA (US)
Filed by Suki AI, Inc., Redwood City, CA (US)
Filed on Oct. 20, 2021, as Appl. No. 17/506,473.
Application 17/506,473 is a continuation of application No. 16/526,140, filed on Jul. 30, 2019, granted, now 11,176,939.
Prior Publication US 2022/0044681 A1, Feb. 10, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 21/00 (2013.01); G10L 15/30 (2013.01); G10L 15/22 (2006.01); G10L 15/08 (2006.01)

CPC G10L 15/22 (2013.01) [G10L 15/08 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01)]

10 Claims

1. A system configured to recognize and execute spoken commands using speech recognition, the system comprising:

electronic storage media configured to store actionable phrases, individual actionable phrases correlating to individual commands, wherein the commands are used during documentation;

one or more processors configured by machine-readable instructions to:

obtain audio information representing sound captured by a mobile client computing platform associated with a user;

detect any spoken instances of a predetermined keyword present in the sound represented by the audio information;

perform speech recognition on the sound represented by the audio information;

responsive to detection of a spoken instance of the predetermined keyword present in the sound represented by the audio information, identify one or more utterances of actionable phrases in speech temporally adjacent to the spoken instance of the predetermined keyword that is present in the sound represented by the audio information;

responsive to detection of the spoken instance of the predetermined keyword present in the sound represented by the audio information and responsive to not identifying the one or more utterances of the actionable phrases in speech temporally adjacent to the spoken instance of the predetermined keyword that is present in the sound represented by the audio information, perform natural language processing to identify individual commands uttered temporally adjacent to the spoken instance of the predetermined keyword that is present in the sounds represented by the audio information; and

effectuate performance of instructions corresponding to the individual commands.