US 12,315,510 B2
	Multiple digital assistant coordination in vehicular environments
Haris Ramic, San Francisco, CA (US); Vikram Aggarwal, Palo Alto, CA (US); Moises Morgenstern Gali, San Francisco, CA (US); and Brandon Stuut, Palo Alto, CA (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Filed by GOOGLE LLC, Mountain View, CA (US)
Filed on Jul. 18, 2024, as Appl. No. 18/777,134.
Application 18/777,134 is a continuation of application No. 18/125,583, filed on Mar. 23, 2023, granted, now 12,073,834.
Application 18/125,583 is a continuation of application No. 17/861,615, filed on Jul. 11, 2022, granted, now 11,646,029, issued on May 9, 2023.
Application 17/861,615 is a continuation of application No. 16/609,491, granted, now 11,410,648, issued on Aug. 9, 2022, previously published as PCT/US2017/054846, filed on Oct. 3, 2017.
Prior Publication US 2024/0404522 A1, Dec. 5, 2024
Int. Cl. G10L 15/22 (2006.01); G06F 9/445 (2018.01); G06F 9/451 (2018.01); G10L 15/08 (2006.01); G10L 15/18 (2013.01); G10L 15/30 (2013.01)

CPC G10L 15/22 (2013.01) [G06F 9/44505 (2013.01); G06F 9/44536 (2013.01); G06F 9/452 (2018.02); G10L 15/1822 (2013.01); G10L 15/30 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01)]

17 Claims

8. A system comprising:

at least one processor; and

memory storing instructions that, when executed, cause the at least one processor to be operable to:

receive, via one or more microphones of a computing device, an input audio signal that includes: (i) a first portion capturing a given trigger word or phrase, and (ii) a second portion capturing a request;

determine, based on processing the first portion of the input audio signal that includes the given trigger word or phrase, a given digital assistant application, from among a plurality of digital assistant applications that are accessible by the computing device, that is associated with the given trigger word or phrase,

wherein the plurality of digital assistant applications that are accessible by the computing device include at least a first digital assistant application and a second digital assistant application,

determine, based on processing the second portion of the input audio signal that includes the request, an action to be performed responsive to the request;

generate, based on the action to be performed responsive to the request, an action data structure including instructions that, when executed, cause the action to be performed responsive to the request;

in response to determining that the given trigger word or phrase is a first trigger word or phrase that is associated with the first digital assistant application:

cause the first digital assistant application to perform the action to satisfy the request, wherein the instructions to cause the first digital assistant application to perform the action to satisfy the request comprise instructions to cause the first digital assistant application to:

receive the action data structure via a first application programming interface (API) associated with the first digital assistant application; and

execute the instructions, included in the action data structure, to cause the action to be performed responsive to the request; and

in response to determining that the given trigger word or phrase is a second trigger word or phrase that is associated with the second digital assistant application:

cause the second digital assistant application to perform the action to satisfy the request, wherein the instructions to cause the second digital assistant application to perform the action to satisfy the request comprise instructions to cause the second digital assistant application to:

receive the action data structure via a second API associated with the second digital assistant application; and

execute the instructions, included in the action data structure, to cause the action to be performed responsive to the request.