US 12,190,873 B2
	Determining whether speech input is intended for a digital assistant
Ahmed S. Hussen Abdelaziz, San Ramon, CA (US); Saurabh Adya, San Jose, CA (US); Alexander W. Churchill, London (GB); Pranay Dighe, Berkeley, CA (US); Sachin S. Kajarekar, Sunnyvale, CA (US); Chaitanya Mannemala, San Ramon, CA (US); Erik Marchi, Zurich (CH); Seyedmahdad Mirsamadi, Santa Clara, CA (US); Ognjen Rudovic, Seattle, WA (US); Ahmed H. Tewfik, Los Altos, CA (US); Barry-John Theobald, San Jose, CA (US); and Srikanth Vishnubhotla, Santa Clara, CA (US)
Assigned to Apple Inc., Cupertino, CA (US)
Filed by Apple Inc., Cupertino, CA (US)
Filed on Sep. 23, 2022, as Appl. No. 17/952,005.
Claims priority of provisional application 63/341,893, filed on May 13, 2022.
Prior Publication US 2023/0368783 A1, Nov. 16, 2023
Int. Cl. G10L 15/22 (2006.01); G06T 7/70 (2017.01); G06V 40/16 (2022.01); G10L 15/16 (2006.01); G10L 15/197 (2013.01); G10L 25/78 (2013.01); G10L 15/08 (2006.01)

CPC G10L 15/197 (2013.01) [G06T 7/70 (2017.01); G06V 40/161 (2022.01); G10L 15/16 (2013.01); G10L 15/22 (2013.01); G10L 25/78 (2013.01); G06T 2207/30201 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01)]

45 Claims

1. A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to:

receive a first speech input representing a first user utterance;

initiate, by a digital assistant operating on the electronic device, a first task based on the first speech input;

provide a first output indicative of the initiated first task; and

after providing the first output:

receive a second speech input following the first speech input, the second speech input representing a second user utterance;

determine, based on a textual representation of the second speech input, a first score representing a correspondence between the second user utterance and a domain recognized by the digital assistant;

determine, based on the textual representation of the second speech input, a second score representing contextual continuity between the first user utterance and the second user utterance;

determine, based on the first score and the second score, whether the second speech input is intended for the digital assistant; and

in accordance with a determination that the second speech input is intended for the digital assistant:

initiate, by the digital assistant, a second task based on the second speech input; and

provide a second output indicative of the initiated second task.