US 12,451,127 B2
Speech processing dialog management
Lambert Leo Mathias, Seattle, WA (US); Bala Murali Krishna Ummaneni, Bellevue, WA (US); Ryan Scott Aldrich, Seattle, WA (US); Diamond Bishop, Seattle, WA (US); Ruhi Sarikaya, Redmond, WA (US); and Chetan Nagaraj Naik, Bellevue, WA (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Nov. 17, 2020, as Appl. No. 17/099,875.
Application 17/099,875 is a continuation of application No. 15/934,199, filed on Mar. 23, 2018, granted, now 10,878,808.
Claims priority of provisional application 62/615,111, filed on Jan. 9, 2018.
Prior Publication US 2021/0142794 A1, May 13, 2021
Int. Cl. G10L 15/18 (2013.01); G06F 16/9032 (2019.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01); G10L 15/22 (2006.01)
CPC G10L 15/1815 (2013.01) [G06F 16/90332 (2019.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
receiving input data corresponding to a natural language user input, the input data corresponding to a user profile;
receiving dialog history data corresponding to the user profile;
determining the dialog history data is associated with a dialog session including a plurality of prior dialog exchanges, wherein a dialog exchange comprises a natural language input and corresponding system response;
determining that a first dialog exchange, of the prior dialog exchanges, is associated with the natural language user input;
determining that a second dialog exchange, of the prior dialog exchanges, is associated with the natural language user input;
determining first data representing the first dialog exchange;
determining second data representing the second dialog exchange;
processing at least the first data and the second data using an encoding component to generate first encoded data representing at least the first dialog exchange, the second dialog exchange, and an additional dialog exchange associated with the dialog history data;
identifying context data associated with the dialog history data;
processing the context data to generate second encoded data;
processing the input data, the first encoded data, and the second encoded data to generate third data;
based at least in part on the third data, selecting an application from a plurality of applications to determine a response to the natural language user input;
sending at least a portion of the input data to the application;
generating, based at least in part on a portion of the input data, response data; and
outputting the response data in response to the natural language user input.